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Micropeptide’s Brawny Powers 

PAGE 595 

Anderson et al. identify a skeletal muscle-specific micropeptide called myoregulin, 
encoded within a long noncoding RNA, that controls skeletal muscle function by 
directly inhibiting the pump activity of the Ca2+ -ATPase SERCA. Mice lacking myor- 
egulin demonstrate improved intracellular calcium handling and exercise perfor- 
mance. 



Gut-Level Individuality 

PAGE 583 

The species composition of gut microbiomes can differ markedly between individ- 
uals. Greenblum et al. now uncover extensive variation within individual gut micro- 
bial species in their gene composition and copy number and link these strain-level 
differences to obesity and inflammatory bowel disease. 



Decoding a Long Noncoding Mystery 

PAGE 607 

EBER2 is a highly abundant noncoding RNA with unknown function that is expressed by the Epstein-Barr virus. Lee et al. find that 
EBER2 localizes to specific repeat sites on viral chromatin to facilitate binding of its interacting host transcription factor PAX5 to these 
sites. Recruitment of the EBER2-PAX5 complex depends upon RNA-RNA interactions between EBER2 and nascent transcripts that 
together control the viral lytic cycle and infection. 



An Extracellular Stop to Influenza 

PAGE 631 

Interferon-stimulated genes (ISGs) act in concert to provide a tight barrier against viruses. Using a screen to identify ISGs inhibiting 
late stages of the influenza A virus infection, Dittman et al. find that plasminogen activator inhibitor (PAI-1) blocks maturation of the 
viral surface glycoprotein, thus reducing virus spread in the airways. These findings show that the innate immune system, driven by 
type I interferon, uses modulation of the extracellular environment to inhibit viruses. 

Stress Test for RNA 

PAGE 644 

The CCA-adding enzyme monitors the stability of tRNAs and tRNA-like small RNAs. Whereas CCA is added to stable RNAs, CCACCA 
is added to unstable ones to initiate their degradation. Kuhn et al. now characterize how these two scenarios are distinguished. 
Following CCA addition, stable RNAs are ejected, whereas unstable RNAs refold and are subjected to a second round of addition. 
Therefore, RNAs proofread themselves through differential responses to the interrogation of the enzyme. 



Switching Speed with a Squeeze 

PAGE 659 and PAGE 673 

Fast amoeboid cell migration in 3D environments is central to developmental and disease-related processes, such as cancer metas- 
tasis. Liu et al. and Rupretch et al. now find that changes in the environment, namely absence focal adhesion and strong confinement, 
trigger the switch in motility behavior. Liu et al. analyze a large set of different slow mesenchymal cells and show that the requirements 
for the motility switch are universal. Rupretch et al. demonstrate that in vivo, during zebrafish development, fluctuations in cortical 
contraction forces act in in concert with confinement and low adhesiveness, allow- 
ing embryonic progenitor cells to acquire a fast and persistent migratory behavior. 



Viruses Gang Up 

PAGE 619 

A central paradigm in virology is that each virus largely behaves as an independent 
infectious unit. Chen et al. now demonstrate that clusters of enteroviruses are pack- 
aged within phosphatidylserine (PS)-enriched vesicles that are nonlytically released 
from cells. Viruses within these vesicles have enhanced infection efficiency, and PS 
lipids serve as cofactors for enterovirus receptors. Clustering viruses within vesicles 
enables multiple viral genomes to be collectively transferred into cells, facilitating 
genetic cooperativity among viral quasispecies and promoting viral replication. 
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Curbing Pain 

PAGE 759 

Inflammatory pathways stimulate the capsaicin receptor TRPV1 , leading to pain hypersensitivity. 
Hanack et al. identify a feedback mechanism that keeps TRPV1 -mediated pain in check and could 
lead to new avenues for analgesic therapy. Release of the neurotransmitter GABA in response to 
TRPV1 activity triggers a noncanonical signaling pathway that inhibits only hyperactive TRPV1 , leav- 
ing homeostatic pain responses intact. 

Catastrophic Cure 

PAGE 686 

Chromothripsis is a catastrophic cellular event, first described in cancer, in which chromosomes are 
shattered and pieced back together imperfectly. McDermott et al. report a remarkable case in which a woman with WHIM syndrome, 
an inherited immunodeficiency disease, was fortuitously cured by a chromothriptic event in a single hematopoietic stem cell that 
deleted the abnormal copy of the disease gene. This cell then took over the bone marrow and restored normal immune function. 

Complement-ing Tumor Suppression 

PAGE 700 

Macrophages have a crucial role in mediating inflammation that contributes to development of cancer. Bonavita et al. report that 
deficiency of PTX3, a soluble mediator that regulates Complement activity, is associated with cancer in humans and causes 
tumor-promoting macrophage recruitment, angiogenesis, and Trp53 mutations in mice. Thus, PTX3 acts as a physiological extrinsic 
oncosuppressor inhibiting tumor generation through modulation of the activity of macrophages. 

A Malignant Energy Switch 

PAGE 715 

AMPK is the master regulator of cellular energy homeostasis, with tumor suppressive activity. Pineda et al. describe a widespread mech- 
anism by which AMPK is inactivated in cancer. Cancer-specific MAGE-A3/6-TRI M28 E3 ubiquitin ligase is an oncogenic driver that ubiq- 
uitinates and degrades AMPKal , resulting in downregulation of autophagy and increased mTOR signaling. These findings identify the 
mechanism of action of the MAGE-A3/6 cancer-testis antigens and illustrate a regulatory axis for altering cellular metabolism in cancer. 

Chaotic Therapy for Cancer 

PAGE 729 

The ERK kinase has been long thought to be the only substrate of MEK. Tang et al. now find that HSF1 , the master regulator of pro- 
teotoxic stress responses, is a new MEK substrate. MEK blockade inactivates HSF1 and provokes proteomic chaos. Tumor cells are 
particularly susceptible to proteostasis disruption. In fact, amyloidogenesis induced by MEK inhibition suppresses tumor growth, 
suggesting that disruptions of the fragile tumor proteostasis may be feasible as therapeutic strategy. 

Inflaming Metabolic Dysfunction 

PAGE 745 

A high-fat diet is associated with several metabolic abnormalities such as hyperglycemia and increased rates of hepatic glucose pro- 
duction. Perry et al. demonstrate that these abnormalities are primarily caused by increased macrophage-derived IL-6 levels in white 
adipose tissue that results in impaired lipolysis and a concomitant increase in acetyl coA levels that fuel hepatic glucose production. 
Inflammation, thus, is likely the underlying trigger of metabolic dysfunction associated with a high-fat diet. 

Constant Change Is the Mother of All Frustration 

PAGE 785 

Pathogens such as HIV and influenza are highly mutable. Their neutralization requires the generation of antibodies that cross-react with 
different viral strains, which it is hard to achieve. Using an in silico approach, Wang et al. find that cross-reactive antibodies occur with 
low probability because conflicting selection forces, imposed by the presence of different variants of 
the same antigen, frustrate affinity maturation. Importantly, frustration can be overcome by sequential 
immunization approaches. 

Conformity Quells Resistance 

PAGE 771 

Many diseases that readily evolve drug resistance are caused by cell populations that acquire diverse 
karyotypes or chromosome copy number. Chen et al. find that growth-suppressing stresses only 
serve to increase the heterogeneity of aneuploid populations, causing resistance to emerge. Using 
a fungal pathogen as a model system, the authors propose and test a strategy termed an “evolu- 
tionary trap” to eradicate such populations. One stress is applied to homogenize the population 
via adaptation; a second specifically targets and eliminates the new newly dominant karyotype. 
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Amino Acid Intricacy 



As the central bridge connecting nutrient sensing and cellular 
activities, mTORCI has for years drawn much attention in the 
cell metabolism field. Its frequent dysregulation in cancer and 
the diverse therapeutic potential of the mTOR inhibitor rapa- 
mycin have further spread its acclaim among cancer biolo- 
gists, clinicians, and pharmaceutical companies. Although 
efforts have yielded an in-depth understanding of the molec- 
ular components that regulate mTORCI localization and ac- 
tivity at various steps along the signaling cascade, a key issue 
that has not been adequately addressed is the amino acid 
specificity in mTORCI regulation. Does every flavor of amino 
acid taste the same to mTORCI? If not, how are they distin- 
guished, and perhaps more importantly, why does it matter? 




Amino Acid Sensing for mTORCI Regulation. Courtesy of K.L. Guan. 



A recent study from the group of Kun-Liang Guan makes a 
breakthrough in pursing this line of inquiry (Jewell et al., 
2015). RagA and RagB are well-characterized essential 
regulators for the lysosomal recruitment and activation 
of mTORCI. Surprisingly, instead of paralyzing the entire 
mTORCI pathway, they find that RagA/B deficiency in 
mouse fibroblasts selectively impairs the ability of mTORCI 
in sensing leucine and arginine, while its sensitivity to 
glutamine remains intact. Together with an earlier work 
from Michael Hall and colleagues in yeast (Stracka et al., 
2014), the findings highlight the mechanistic differences in 
TORC1 activation upon stimulation of different amino acids. 
Further investigation reveals the presence of an ADP ribosy- 
lation factor 1 (Arf Independent, Rag-independent pathway 
for mTORCI recruitment to the lysosome, suggesting a 
context-specific role, rather than universal requirement, of 
Rag in mTORCI localization. Although this pathway appears 
to be dedicated to glutamine-mediated mTORCI regulation 
and leads to distinct recruitment kinetics compared to its 
Rag-dependent counterpart, the relative contribution of 
these two pathways to mTORCI activation in normal physio- 
logical context remains to be determined. As Jewell et al. 
(201 5) mention in their paper, glutamine addiction is a feature 
shared by a variety of cancer cells. Therefore, it would be 
intriguing to explore whether this glutamine-specific sensing 
mechanism would be particularly important in the cancer 



setting and to test its potential as a target for anti-cancer 
therapeutic development. 

The work in yeast and mouse fibroblasts reinforces the 
notion that TORC1 does not see every amino acid in the 
same way and suggests that the mechanisms behind this 
may be evolutionary conserved. However, this leaves unre- 
solved the identity of the direct amino acid sensor. Two 
studies from Giulio Superti-Furga and David Sabatini repre- 
sent the most recent progress toward this goal (Rebsamen 
et al., 2015; Wang et al., 2015). The two groups indepen- 
dently characterize a member of the solute carrier family 
38, SLC38A9, as a lysosomal membrane-resident amino 
acid transporter that is required for mTORCI activation by 
certain amino acids. The characterization of the amino acid 
binding and transportation activity of SLC38A9 is exciting 
and certainly in line with the putative role of SLC38A9 being 
an amino acid sensor. However, fully supporting this notion 
requires direct evidence linking amino acid binding and its 
impact on signal transduction. Moreover, it appears that 
SLC38A9 has a rather non-specific substrate profile. It is 
therefore unclear whether, and if so to what extent, 
SLC38A9 might be involved in differential mTORCI regula- 
tion by various amino acid inputs. In addition to lysosome 
membrane-based sensors, the work from Jewell et al. 
(2015) and others suggests that differential sensing occurs 
even before mTORCI recruitment to the lysosome. The 
molecular basis for this early-stage sensing is yet another 
open question. Indeed, as much as we have already known 
about mTORCI regulators, we are just getting started in 
appreciating the mechanistic intricacy of the amino-acid- 
sensing component of mTORCI signaling. 
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Mary Frances Lyon (1925-2014) 



Mary Lyon was one of the most notable 
geneticists of the 20th century. She is 
renowned for her discovery of X inacti- 
vation, an early example of epigenetic 
gene regulation, but she also made funda- 
mental contributions to the entire field of 
genetics. 

Mary was born on May 15, 1925 in 
Norwich, in the rural east of the United 
Kingdom. She was the first child of 
Clifford James Lyon, a civil servant, and 
Louise Frances Lyon (nee Kirby), a former 
school teacher. When she was 1 0, Mary’s 
family, which by then included a younger 
brother and sister, moved to Birmingham. 
There, she attended King Edward VI High 
School for Girls and began her scientific 
career, like so many of us, through an 
inspirational teacher who made lessons 
interesting: Mary Udall, whom Mary 
described as having “a clear analytical 
mind.” Mary was fascinated by physics 
and chemistry, but the prize she received 
from an essay competition, four books 
on nature study, ultimately led to her 
becoming a world-class biologist. 

The Second World War helped change 
the status of women, and Mary decided 
to take the unusual step of reading natural 
sciences at the University of Cambridge 
(Girton College), where women took the 
same coursework as men but at that 
time were awarded only titular degrees. 
She focused on zoology and became 
interested in experimental embryology, 
graduating in 1 946 with her titular degree. 
Rather unusually, she went on to Ph.D 
studies with R.A. Fisher, the Balfour Chair 
of Genetics at Cambridge, who founded 
much of the field of statistics and devel- 
oped analysis methods for early gene 
linkage studies. 

R.A. Fisher was in his fifties— brilliant, 
eccentric, and difficult. He threw out 
many who joined his lab, but he led Mary 
directly to her lifelong study of mouse 
genetics. At the time, there was no 
systematic method of mapping mouse 
genes. With 20 mouse chromosomes, 
Fisher decided that, if he crossed all 
possible combinations of his 21 visible 
mouse mutants, he would likely detect a 
new linkage. Students were each given a 
line of mice carrying five mutations, and 
Mary took on line 18, which included the 



“pallid” mutation. Already an insightful 
experimentalist, Mary was unconvinced 
that her mouse-crossing experiments 
would give her enough data for a Ph.D, 
but she noticed that pallid mice tended 
to tip their heads to one side. She found 
that they were missing otoliths in the inner 
ear and went on to investigate the effects 
of penetrance and to work out how otolith 
absence correlated with postural reflexes. 

Meanwhile, Mary was reading books by 
Conrad Waddington, known as the father 
of epigenetics, and became aware of the 
novel idea that embryonic development 
depends on genes— at a time when the 
exact nature of a gene was unknown. 
R.A. Fisher was not interested in develop- 
mental genetics, and Mary needed facil- 
ities for histology; so as Waddington had 
returned from his wartime post (scientific 
advisor to the Royal Air Force) to become 
Professor of Animal Genetics at the 
University of Edinburgh, Mary moved to 
his department and to a new supervisor, 
Douglas Falconer. 

Waddington and Falconer were impor- 
tant influences on Mary; she chose to 
stay in mouse genetics due to the rele- 
vance to human studies. Waddington 
applied to the Medical Research Council 
(MRC)— which unlike other funders 
awarded equal pay for women— and 
gained postdoctoral funding for Mary to 




Mary Frances Lyon 



stay in Edinburgh with Toby Carter, work- 
ing on the inherited genetic risks from 
exposure to ionizing radiation, an area of 
great concern after WWII. Several novel 
mutants came out of this research, but fa- 
cilities were not available for the amount 
of mouse breeding required, so in 1955, 
the entire mouse group moved to the 
MRC Radiobiological Research Unit at 
Harwell, led by John Loutit. There, Mary 
worked for an astonishingly productive 
period spanning more than 50 years. In 
the late 1950s, much of her research 
centered on the novel chromosome 
translocations from the Edinburgh muta- 
genesis work, working with Carter, Tony 
Searle, and one of the early great cyto- 
geneticists, Charles Ford. 

Mary knew about X-linked mouse 
mutants since her Edinburgh days, and 
at Harwell she focused on mottled mu- 
tants, in which females have patches of 
two different coat colors. She noted that 
males either died in embryogenesis 
or had a single coat color. However, one 
mouse grabbed her attention (ironically, 
a spontaneous mutant, not a radiation 
mutant): a mottled male. Mary bred this 
male and realized that, if a mutation had 
occurred when he was an embryo of just 
a few cells, he would be a mosaic of 
mutated and normal X chromosomes. 
Her analysis led to the discovery that 
this could also apply to his mottled 
daughters, who had two types of cells: 
one with an active gene and one without. 

Mary’s knowledge that female mice 
(unlike female humans) need only one 
X chromosome for normal development 
and have sex chromatin in their nuclei 
led her straight to the work of Susumo 
Ohno, whom she regarded as an excep- 
tionally creative and gifted scientist, and 
his discovery that the sex chromatin was 
a condensed X chromosome. Mary had 
already considered the idea that only 
one X chromosome is active in females, 
but her analysis of the mottled mouse, 
in the light of Ohno’s discovery, led her 
to publish her X inactivation hypothesis 
in Nature in 1961 . 

This concept, now often referred to 
as lyonization, was not greeted with 
universal applause. Hans Gruneberg, 
a distinguished geneticist who made 
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important contributions to understanding 
human disease through mouse genetics, 
simply did not believe the X inactivation hy- 
pothesis. Partly, it made no sense to have 
patches of inactivated cells (we now know 
that these are clones from early inactiva- 
tion events), and partly he may have 
thought that Mary was too young and un- 
known to have such a fundamental insight. 
Gruneberg (a clever man, whose papers 
are worth re-reading) was 18 years older 
than Mary, well-known, and had been 
made a Fellow of the Royal Society (FRS) 
in 1956. He must have been a formidable 
opponent, but Mary stood her ground. 

In the 1980s, Mary, who only ever 
worked with a small group, became 
involved with working out the human and 
mouse homology maps, alongside Har- 
well and Oxford scientists, including 
John Edward. This helped to lay the 
foundation for the Human Genome 
Sequencing Project 20 years or so later. 
Mary made major advances in under- 
standing the inherited effects of radiation 
damage and seminal contributions to 
many areas of mouse genetics, including 
the t complex, embryo freezing with David 
Whittingham to preserve mouse stocks, 
and some early work with Richard Gard- 
ner in Cambridge on mouse preimplanta- 
tion embryos to determine the timing of 
X inactivation. 

Mary also edited Mouse News Letter 
(MNL) (1956-1970); in the postwar envi- 
ronment, UK and USA mouse geneticists 
in particular had close and long-standing 



friendships and helped each other greatly. 
MNL was an informal publication mailed 
between them and other groups world- 
wide, providing each other with informa- 
tion about novel mutants and linkage 
groups— an important “bulletin board” in 
the pre-Internet world. 

Mary won numerous awards and was 
elected an FRS in 1973 and a Foreign 
Associate of the US National Academy 
of Sciences in 1979 but was never one 
to push herself forward and did not 
receive the other honors she deserved. 
She was Head of Genetics at Harwell 
from 1962 to 1986, when she was glad 
to relinquish the increasing administrative 
burden to her successor, Bruce Catta- 
nach. Under her stewardship, the Ge- 
netics Division had become a world-class 
center and was eventually established as 
the MRC Mammalian Genetics Unit, now 
directed by Steve Brown. Mary continued 
working productively at Harwell for many 
years following her official retirement 
in 1990. In 1998, Cambridge University 
held a degree ceremony at which Mary 
was awarded a full undergraduate de- 
gree. In 2004, the Mary Lyon Centre was 
opened at Harwell, in a fitting tribute to 
her contributions. Rather poignantly, the 
UK Genetics Society named a new medal 
after Mary last year, and though she knew 
of the medal, she did not live to meet its 
first awardee, Loeske Kruuk. Mary was 
diagnosed with Parkinson’s disease a 
few years ago and died peacefully on 
Christmas Day, 2014. 



This sums up the career, but not quite the 
woman. Mary was small and seemingly 
quiet. When one of us (E.M.C.F.) inter- 
viewed for a Ph.D position in 1983, Pete 
Glenister, Mary’s long-time research 
assistant, warned “not to speak in the 
silences”— the silences were Mary’s phe- 
nomenal mind at work, considering the 
data. When the other of us (J.P.) started 
work as a lab head at Harwell in 1978, 
Mary herself carried out the first set of 
ENU mutagenesis injections, just to help 
out. Mary had an extraordinary mind and 
memory and exceptional clarity. Her mild 
manner concealed an indomitable figh- 
ter on research matters where she felt 
strongly. She was a forward thinker and 
would have been thrilled to carry out 
experiments today using the panoply of 
research tools now available in mice. She 
also loved parties, never turning down an 
invitation. She was held in great affection 
by those who knew her, and she will 
be missed— an intellectual colossus, just 
over 5 feet tall. 
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RNA: Jack of All Trades and Master of All 
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Noncoding RNAs have regulatory capabilities that evolution harnesses to fulfill diverse functions. 
Lee et al. show that a noncoding RNA from Epstein-Barr virus recruits a host transcription factor 
to silence virus gene expression and propose that it does this through base-pairing with nascent 
viral transcripts. 



As noted years ago by Frangois Jacob, a 
broad set of processes that regulate 
gene expression appear to be the product 
of evolutionary tinkering (Jacob, 1977). 
For decades these mechanisms were 
thought to be exclusively protein-driven, 
but, as would be predicted by unfettered 
tinkering, many are now known to involve 
regulatory RNAs. These RNAs employ 
simple yet highly flexible modes of inter- 
action with proteins and other nucleic 
acids to regulate every aspect of gene 
expression and function. In this issue of 
Cell, Lee et al. (201 5) from the Steitz labo- 
ratory add a new trick in the repertoire of 
regulatory RNAs. The authors examine 
the function of an Epstein-Barr virus 
(EBV) noncoding RNA, EBER2, and, using 
capture hybridization analysis of RNA tar- 
gets (CHART) (Lee et al., 2015 and refer- 
ences therein), find that EBER2 localizes 
to the tandem terminal repeats (TRs) in 
the EBV genome, in the vicinity of where 
the PAX5 host transcription factor binds 
(Arvey et al., 2012). The authors go on to 
show that EBER2 interacts with PAX5, 
albeit indirectly. Based on structure 
predictions, phylogenetic conservation in 
other related gamma herpesviruses, and 
experimental data, they also propose 
that EBER2 forms an 18 bp hybrid with 
intronic TR sequences in viral LMP2 
nascent transcripts. This RNA-RNA inter- 
action brings the EBER2 associated PAX5 
to the vicinity of its DNA binding site to 
enhance repression of LMP genes likely 
through chromatin remodeling (Figure 5 
in Lee et al. 2015). 

This provides a possible answer to the 
long open question regarding the function 
of the abundant EBERs. In that regard, 
several interesting questions are raised 
by the manuscript, does EBER1 also 



interact with PAX5? Indeed, careful in- 
spection of Figure 2B in Lee et al. sug- 
gests that this may be the case. Could 
this explain the small effect of EBER2 
knockdown on PAX5 binding to the TR? 
As the authors themselves ponder— 
what about EBV strains deleted for 
EBER2 (or both EBERs)? It is interesting 
to wonder whether the phenotypes ob- 
served with these strains (and there is 
controversy here) could be partially 
rescued by directly enhancing the PAX5 
TR DNA interaction. These experiments 
would address the importance of EBER- 
mediated PAX5 recruitment for EBV repli- 
cation and latency. As interesting as these 
questions are, the model of Lee et al. rai- 
ses even more fascinating possibilities 
with general impact on RNA biology. 

The model proposed in Figure 5 of Lee 
et al. represents a remarkable example 
of the versatile ability of RNAs to build 
complexes required for constitutive and 
regulated gene function. It also raises 
interesting questions. Can EBER2 base 
pair with TR sequences in DNA, which 
would be accessible only when the region 
is transcribed? This scenario is not 
mutually exclusive with base-pairing to 
nascent RNAs, and one could imagine 
how the EBER2 ribonucleoprotein would 
be handed from nascent RNA to DNA to 
bring PAX5 very close to its DNA binding 
site. Given the high density of nascent 
transcripts in many genomic regions, it 
is possible to imagine nascent RNAs as 
nets of binding sites that localize trans- 
activators near their eventual site of ac- 
tion. The ideas provoked by this manu- 
script add one more chapter to the rapidly 
evolving RNA story. 

It is now clear that RNAs participate in 
almost every facet of the biology of cells 



and viruses, and based on their function, 
RNAs have been categorized as pro- 
tein-coding mRNAs or noncoding, which 
lack discernable open reading frames. 
Although this division is arbitrary and in 
many cases based on the absence of evi- 
dence, it has been widely used and serves 
as practical way to organize our rapidly 
changing understanding of RNA biology 
(Mercer etal., 2009). Excellent comprehen- 
sive reviews on noncoding RNAs (ncRNAs) 
and their many functions have been pub- 
lished (Mercer et al., 2009; Guttman and 
Rinn, 2012; Cech and Steitz, 2014). 

Indeed, ncRNAs have many properties 
of adaptable regulators (Figure 1A): (1) 
RNAs, like DNAs, can “read” sequences 
by base-pairing and this ancient mode 
of nucleic acid-nucleic acid recognition 
provides very high specificity with mini- 
mal investment of genetic material. In 
contrast, proteins that “read” nucleic 
acid sequence generally do so by building 
complex binding domains (such as Puf 
proteins) (Wang et al., 2002). Additionally, 
RNAs have a proclivity to form structures 
that enhance base pairing and their 2! 
OH provides opportunities for hydrogen 
bonding. (2) RNAs interact with proteins 
using sequence, chemical modification 
of bases and sugars, and their secondary 
or tertiary structure. (3) RNAs, like pro- 
teins, are modular and can use domains 
or different surfaces within one domain 
to interact with other molecules (Guttman 
and Rinn, 2012). Furthermore, discrete 
interaction domains can be connected to 
form flexible modular scaffolds (Fig- 
ure 1A). The modular nature of RNAs 
and the versatility of each module for 
diverse interactions dramatically expand 
the repertoire of regulatory RNAs and 
explain their exquisite specificity. 
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Figure 1. RNA Modularity and Interaction Versatility 

(A) The schematic presents an RNA with two interaction modules (I and III) connected via a linker (II), which could be a hybrid linker in cases where I and III are in 
different molecules (e.g., CRISPRs). Each interaction module can interact with a diverse set of types of ligands. It is very likely that RNAs, like proteins, will be 
found to interact with every other type of macromolecule and small molecule present in cells (represented by the ”’?) (as already predicted by riboswitches and by 
the ability to select for binding to very different ligands in vitro). 

(B) The example discovered in Lee et al. (2015) is presented in which an RNA molecule ( EBER2 ) bridges between a protein (PAX5) and a second RNA 
(LMP2). 



A modulary RNA code, whereby 
discrete interaction domains can be 
combined into flexible modular scaffolds 
(l-lll in Figure 1A) (Guttman and Rinn, 
2012; Mercer et al., 2009), makes RNA a 
highly malleable substrate for evolutionary 
tinkering. This has been particularly 
apparent where rapid evolution is required 
as in host-pathogen interactions, such 
as the EBER2-PAX5 interaction. In fact, 
ncRNAs, encoded by both host and path- 
ogen, play important roles in the control of 
the innate and acquired immune systems 
by altering every step of gene expression 
(Cech and Steitz, 2014). An excellent 
example of the modular evolution of RNA 
domains involved in host-pathogen inter- 
actions is provided by flaviviruses, such 
as dengue viruses, which cleave > 90 % 
of the genomes in infected cells to form a 
ncRNA derived from the 3 ' UTR. Elements 
in the 3 ' half of the ncRNA are conserved 
to serve in regulating translation of these 
viruses but elements in the 5' half, also 
known as the variable region, evolve 
rapidly to counter different components 
of host innate immunity (Bidet and Gar- 
cia-Bianco, 2014). 



The resourcefulness of partner recog- 
nition by RNAs is exemplified by the 
EBER2: nascent TR:(X):PAX5 ribonucleo- 
protein (Figure IB). EBER2 assembles 
with unknown factors (X) and hijacks 
PAX5. Additionally, EBER2 base-pairs 
with nascent RNAs to bring PAX5 in the vi- 
cinity of its DNA binding site— a new twist 
for trans - acting RNA. Small nuclear RNAs 
(snRNAs) are known to base-pair with 
nascent transcripts (pre-mRNAs) to 
mediate RNA splicing, as suggested by 
the Steitz group 35 years ago (Lerner 
et al., 1980), and HIV-1 Tat protein binds 
nascent TAR RNAs to recruit the cellular 
transcription factor P-TEFb to the lentivi- 
ral LTR (Wei et al., 1998). The modules 
described by Lee et al. are not new but 
the combination is— tinkering with any 
available part to build a new machine. 
We argue that the versatility of RNA 
makes it an exceptionally adept at sam- 
pling many forms and interactions that 
can assemble into a diverse array of ma- 
chines, some of which will be selected. 
Whether or not RNA-based machines 
that recognize nascent transcripts are 
widely used is unclear. What is a foregone 



conclusion, however, is that there are 
many RNA-based surprises ahead. 
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Migrating cells exhibit distinct motility modes and can switch between modes based on chemical or 
physical cues. Liu et al. and Ruprecht et al. now describe how confinement and contractility influ- 
ence motility mode plasticity and instigate a mode termed stable bleb migration in embryonic and 
tumor cells. 



Crawling cell migration arose early in the 
evolution of eukaryotes and is exhibited 
by diverse eukaryotic lineages (Fritz-Lay- 
lin et al., 2010). In multicellular animals, 
cell migration is crucial in development, 
tissue maintenance, and immunity. Cell 
migration gone awry can also contribute 
to disease, for example, during cancer in- 
vasion and metastasis (Friedl and Alex- 
ander, 2011). The migration behaviors of 
animal cells are heterogeneous and differ 
depending on cell type, developmental 
stage, local environment, and disease 
state. In this issue of Cell, studies from 
the Piel group (Liu et al., 2015) and Hei- 
senberg group (Ruprecht et al., 2015) 
uncover how the plasticity of cell mi- 
gration behaviors is impacted by the 
strength of adhesion, physical confine- 
ment (squeezing between two surfaces), 
contractility, and chemical cues. 

To make sense of the heterogeneity of 
migration behaviors, the migration modes 
of single cells have been divided into two 
broad categories— mesenchymal and 
amoeboid — based on cell morphology, 
mechanism of force generation, cytoskel- 
eton organization, and characteristics of 
the cell-substrate interaction (Figure 1) 
(Friedl and Wolf, 201 0). The mesenchymal 
mode, exemplified by fibroblasts, is typi- 
fied by slower velocity, irregular shape, 
strong cell-substrate adhesion, promi- 
nent actin stress fibers, and actin-rich 
leading edge structures, including lamelli- 
podia and filopodia (Gardel et al., 2010). 
The amoeboid category is characterized 
by a faster velocity, rounder shape, 
weaker cell-substrate adhesion, and the 
absence of stress fibers (Lammermann 
and Sixt, 2009). Distinct amoeboid 
motility modes, however, employ different 
leading edge structures. Immune cells 
typically form actin-rich pseudopods at 



their leading edge (Figure 1, pseudopod 
mode), for which protrusion is driven by 
local actin assembly. Other cells— zebra- 
fish primordial germ cells, for example— 
instead form actin-deficient blebs at their 
leading edge, for which protrusion is 
driven in part by contraction at the rear 
by actin and myosin II (Figure 1, bleb 
mode) (Paluch and Raz, 2013). There is 
plasticity between motility modes, and 
cells can switch between modes, de- 
pending their environment (Friedl and 
Wolf, 2010). 

To investigate the effects of adhesion 
strength and physical confinement on 
cell migration plasticity, Liu et al. (2015) 
systematically vary the strength of adhe- 
sion and degree of confinement of a fibro- 
blast cell line that normally migrates in 
the mesenchymal mode. They observe 
that, under conditions of low 2D adhesion 
and high 3D confinement (Figure 1), fibro- 
blast cells switch from mesenchymal 
to one of two higher-velocity amoeboid 
motility modes, a so-called mesen- 
chymal-to-amoeboid transition (MAT). 
Cells migrating in the Al mode have a 
round cell body and small leading edge 
resembling a pseudopod or lamellipod. 
Unexpectedly, cells migrating in the A2 
mode have an unusually long and ellipsoid 
cell body and leading edge. Examination 
of numerous other cell lines shows that 
many favor one mode or the other, but 
some exhibit both modes. Notably, trans- 
formed and tumor cells, as well as leuko- 
cytes, prefer the A2 mode. These findings 
suggest that individual cell lines exhibit 
migration mode plasticity and that adhe- 
sion strength and confinement are key pa- 
rameters in controlling migration behavior. 

At the molecular level, Liu et al. (2015) 
find that the ability of tumor cells to switch 
to the A2 mode depends on diminished 



activity of proteins that contribute to 
cell-substrate adhesion. Moreover, con- 
ditions of high contractility generated by 
myosin II favor the A2 mode, whereas 
conditions of low contractility favor the 
Al mode. The driving force for the A2 
mode is linked to global retrograde flow 
of actin and myosin II in the central region 
of the cell, whereas the Al mode is linked 
to retrograde flow only in the leading 
edge. In A2 cells, the result of more global 
cortical flow of actin and myosin II is the 
accumulation of these proteins at the 
cell rear and the formation of a stable 
bleb at the leading edge that is largely 
devoid of actin. Mathematical modeling 
also predicts the appearance of this sta- 
ble bleb in a regime of high contractility. 
In a previous study, cells migrating with 
a stable bleb-like protrusion dependent 
on myosin II function were observed in 
Dictyostelium discoidium treated with qui- 
nine, although the physiological signifi- 
cance remained unclear (Yoshida and 
Inouye, 2001). The Liu et al. (2015) study 
advances this earlier work by examining 
physiologically relevant environmental 
factors that influence migration plas- 
ticity and stable bleb migration and by 
providing a mechanistic description of 
motility. 

In a companion study, Ruprecht et al 
(2015) examine the migration modes of 
primary germ layer progenitor cells iso- 
lated from zebrafish embryos. They find 
that cells plated on a 2D substrate 
undergo amoeboid blebbing or mesen- 
chymal motility, depending on the germ 
layer origin of the cells and the extracel- 
lular matrix composition of the substrate. 
However, if cells of various origins are 
treated with serum or the serum compo- 
nent lysophosphatidic acid (LPA), they 
switch to a rapid amoeboid motility 
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Figure 1. Motility Modes, Influencing Parameters, and the Initiation and Maintenance of 
Stable Bleb Motility 

Motility modes include mesenchymal motility, or various forms of amoeboid motility characterized by 
blebs, pseudopods, or stable blebs. These are influenced by environmental factors, including the strength 
of adhesion to the substrate, or the extent of physical confinement and contractility. The formation of a 
stable bleb is theorized to involve fluctuations in cortical contractility, followed by symmetry breaking and 
the formation of a bleb. Cortical flow of actin and myosin II toward the cell rear then enhances contractility 
in this location, stabilizing the formation of a single bleb at the front and generating force that drives 
migration. 



mode characterized by the formation of a 
single stable bleb, similar to the A2 mode 
described by Liu et al. (2015). Spatial 
confinement of zebrafish cells in the 
absence of serum also induces stable 
bleb formation and migration. Further- 
more, for both LPA-induced and confine- 
ment-induced stable bleb migration, 
myosin II accumulate at the rear of the 
cell, and myosin-ll-driven contractile ac- 
tivity is required. These results, together 
with those of Liu et al. (2015), indicate 
that stable bleb motility is observed in pri- 
mary cells and cell lines isolated from 
various animal species. 

To understand how cortical contractility 
contributes to stable bleb motility, Ru- 
precht et al. (2015) develop a mathe- 
matical description of this process. Their 
model predicts that local fluctuations in 
cortical contractility at the cell periphery 
are amplified by external cues or physical 
confinement, causing symmetry breaking 
and initial polarization (Figure 1). Cell po- 
larization is then enhanced and stabilized 
by positive feedback between cortical 
flow of actin and myosin II toward the 



cell rear and the formation of a cortical 
contractility gradient that reinforces flow. 
In support of this model, they observe 
coupling of cortical flow of actin and 
myosin to stable bleb migration. Thus, 
the theoretical model and experimental 
evidence suggest that contractility itself 
is sufficient to initiate stable bleb motility. 

They then examine the incidence of sta- 
ble bleb motility in zebrafish embryos in 
response to contractility. Cells expressing 
a constitutively active variant of the Rho 
family G protein RhoA (to enhance cortical 
contractility) exhibit similar shape, migra- 
tion behavior, and cortical actin and 
myosin II flows as stable bleb cells 
in vitro. Moreover, when transferred into 
cell culture, these cells exhibit charac- 
teristics indistinguishable from primary 
germ layer cells undergoing stable bleb 
motility. Notably, at wounding sites in 
embryos, which exhibit high levels of 
contractility, cells not expressing exoge- 
nous RhoA also assume a stable bleb 
motility mode and migrate from the 
wounding site with the rapid speed, direc- 
tional persistence, and characteristics of 



stable bleb cells in vitro. This confirms 
that stable bleb motility initiates at regions 
of high contractility in a developing 
embryo. 

The findings of Liu et al. (2015) and Ru- 

precht et al. (2015) for animal cells, along 
with the previous study of Dictyostelium 
amoeba (Yoshida and Inouye, 2001), sug- 
gest that stable bleb motility is a funda- 
mental motility mode of eukaryotic cells 
and that it operates in a variety of 
contexts. These may include embryonic 
development, where it is speculated that 
this rapid and directionally persistent 
motility enables extrusion of cells from re- 
gions of high contractility and mediates 
long-range cell interactions. Moreover, 
stable bleb motility is prevalent in trans- 
formed and tumor cells subjected to 
confinement, suggesting that it may be a 
fundamental property related to invasion 
and metastasis. Future work will establish 
the detailed mechanisms of force genera- 
tion during stable bleb motility, the contri- 
bution of this process to development 
and tissue maintenance, and the impact 
of this form of migration on cancer and 
other diseases. 
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SUMMARY 

Within each bacterial species, different strains may 
vary in the set of genes they encode or in the copy 
number of these genes. Yet, taxonomic character- 
ization of the human microbiota is often limited to 
the species level or to previously sequenced strains, 
and accordingly, the prevalence of intra-species 
variation, its functional role, and its relation to host 
health remain unclear. Here, we present a compre- 
hensive large-scale analysis of intra-species copy- 
number variation in the gut microbiome, introducing 
a rigorous computational pipeline for detecting such 
variation directly from shotgun metagenomic data. 
We uncover a large set of variable genes in numerous 
species and demonstrate that this variation has 
significant functional and clinically relevant impli- 
cations. We additionally infer intra-species com- 
positional profiles, identifying population structure 
shifts and the presence of yet uncharacterized vari- 
ants. Our results highlight the complex relationship 
between microbiome composition and functional 
capacity, linking metagenome-level compositional 
shifts to strain-level variation. 

INTRODUCTION 

The human gut microbiome plays an important role in host meta- 
bolism, immunity, and drug response and has a tremendous 
impact on our health (lida et al., 2013; Kinross et al., 2011; Vi- 
jay-Kumar et al., 2010). Numerous comparative studies aiming 
to characterize the contribution of the microbiome to human 
health have already demonstrated marked shifts in the relative 
abundance of various species, genera, or phyla in various dis- 
ease states (Frank et al., 2007; Hoffman et al., 2014; Larsen 
et al., 2010; Turnbaugh et al., 2009). Clearly, however, each 
microbial species represents many different strains that may 
encode considerably different sets of genes and a different 
number of copies of each gene (reflecting, for example, gene de- 
letions and duplication events). Such intra-species variation en- 
dows each strain with potentially distinct functional capacities. 
Studies of individual isolates of cultured species have indicated, 
for example, that strains often differ in virulence (Gill et al., 2005; 
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Salama et al., 2000; Solheim et al., 2009), motility (Zunino et al., 

1994), nutrient utilization (Siezen et al., 2010), and drug resis- 
tance (Gill et al., 2005). Accordingly, the true functional potential 
of a microbiome cannot be inferred from species composition 
alone, and species-level comparative analyses may fail to cap- 
ture important functional differences across samples. Recent ef- 
forts to catalog the relative abundance of known strains in human 
microbiome samples (Kraal et al., 2014) may recover some of 
these differences but are limited to sequenced reference ge- 
nomes and are not able to identify novel, yet-to-be-sequenced 
variation. Gene-centric shotgun metagenomic studies, on the 
other hand, may identify genes or pathways that are differentially 
abundant across samples but cannot necessarily attribute these 
shifts to specific species or strains. Specifically, it is often un- 
clear how much of the observed variation in gene composition 
is due to variation in the abundances of species and how much 
is contributed by intra-species variation. Indeed, conflicting re- 
sults have been reported, with trends identified among species 
profiles that are often poorly translated to gene profiles and 
vice versa (Muegge et al., 2011; Turnbaugh et al., 2009). It is 
therefore not yet clear how prevalent gene-level intra-species 
variation is in the human gut, whether such variation is adaptive 
and affects specific functions, and how much of this variation 
has already been captured by reference genomes. 

Some evidence already suggests that variation among strains 
is common in the human gut. Several studies have focused 
specifically on nucleotide-level variation, assessing, for example, 
the prevalence and stability of single-nucleotide polymorphisms 
across numerous metagenomes (Schloissnig et al., 2013) or 
the level of sequence diversity across multiple near-complete 
genomes from two bacterial species variants obtained by 
single-cell sequencing (Fitzsimons et al., 2013). Other studies 
have taken steps to associate sequence variation with gene-level 
differences, identifying, for example, areas of variable coverage 
and the coordinated loss of genes from specific gene families 
within the Streptococcus mitis B6 genome (Human Microbiome 
Project Consortium, 2012) or a diverse array of strain-specific 
adhesion-like protein genes across cultured strains of Meth- 
anobrevibacter smithii (Hansen et al., 2011). Additional studies 
have used extensive manual genomic reconstruction to track 
strain-resolved shifts over time in Actinomycetaceae in the 
relatively low-complexity premature infant gut microbiome 
(Brown et al., 2013); to detect differences related to antibiotic 
resistance, transport, and biofilm formation among three strains 
of Staphylococcus epidermis (Sharon et al., 2013); or to identify 
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the variable presence of genes involved in transport, motility, 
carbohydrate metabolism, and virulence in two distinct strains 
of Citrobacter (Morowitz et al., 2011). These gene-level studies, 
however, mostly report small-scale or anecdotal results, focusing 
on one or a small number of species and often on specific gene 
families. A high-throughput, comprehensive analysis of gene- 
level variation across a large array of species in the human gut 
is therefore needed to more fully appreciate the extent and func- 
tional implications of strain variation in this complex microbiome. 

To address this challenge, here we establish a rigorous and 
robust pipeline to estimate the copy number of each gene in a 
large set of prevalent gut microbial species in a given sample 
directly from metagenomic shotgun data and, furthermore, to 
detect copy-number variation across samples. We carefully cali- 
brate this pipeline to confirm that it can successfully estimate the 
copy number of individual genes in individual species on a large 
scale. Applying this pipeline to 1 09 metagenomic samples from a 
recent study of the gut microbiomes of healthy, obese, and in- 
flammatory bowel disease (IBD)-afflicted individuals, we esti- 
mate the copy number of more than 4,000 gene groups across 
70 species in each of these samples and demonstrate the pres- 
ence of widespread copy-number variation within many genes in 
many species. We find that specific functions are especially 
prone to copy-number variation, including functions relevant to 
a community lifestyle and adaptation to the gut environment, 
and further detect associations between strain variation and 
host phenotype. Finally, we demonstrate that these copy-num- 
ber estimates can be used both to model the composition of 
known strains within each sample and to offer insight into com- 
plex population structures, suggesting the presence of yet un- 
characterized species variants. 

RESULTS 

A Pipeline for Calculating Genomic Copy-Number 
Estimates in Metagenomic Samples 

We developed a pipeline to confidently detect variation in gene 
content and gene copy number in a large set of prevalent human 
gut microbes directly from metagenomic data (Figure 1 and 
Experimental Procedures). Briefly, this pipeline works as follows. 
Shotgun metagenomic short reads were first aligned to a set of 
reference genomes representing dominant and prevalent gut 
microbiome strains. To account for the potentially multiple ge- 
nomes available for each species in this reference database, ge- 
nomes were grouped into clusters using a previously introduced 
sequence similarity-based method (Schloissnig et al., 2013). 
These clusters represent approximate species-level groups, 
though in some cases may not reflect classical taxonomic divi- 
sions. We used extensive simulations to carefully select align- 
ment parameters and confirmed that, with these parameters, 
reads mapped to the correct region and correct genome cluster, 
whereas reads from genome clusters not represented in our 
reference database remained unmapped (Figure 2A; Figure SI ; 
Extended Experimental Procedures). In parallel, gene coding re- 
gions from all reference genomes were annotated with KEGG 
orthology groups (KOs). Reference genomes and KOs with low 
confidence mapping were identified and excluded (Figure S2; 
Extended Experimental Procedures). For each sample, coverage 



across each KO-annotated region in each reference genome 
was calculated, and coverage values across regions corre- 
sponding to the same KO in the same genome cluster were 
summed. We then used the average coverage of 13 single 
copy marker genes, carefully selected for their universality, map- 
ping accuracy, and coverage consistency (Figure S3; Extended 
Experimental Procedures), to convert the calculated coverage of 
each KO in each cluster to a copy-number estimate (Experi- 
mental Procedures). Overall, this process estimated the copy 
number, V kcs , of each KO k, in each genome cluster c, detected 
in each sample s (Figure 1). Notably, copy-number estimates 
represent an average across the various genomes associated 
with each cluster that are present in the sample and across the 
potentially multiple genes associated with each KO. We further 
performed an analysis of an extensive synthetic dataset to 
confirm that this scheme accurately recovers species abun- 
dances and copy-number values (Figures S4A and S4B; 
Extended Experimental Procedures). 

We applied this pipeline to a dataset of 1 09 previously collected 
gut metagenomic samples from a Danish/Spanish cohort (Qin 
et al., 2010), mapping in total >2.45 billion 75 bp reads to 235 
reference genomes grouped into 96 genome clusters (Table SI ; 
Extended Experimental Procedures). The average coverage 
across the 13 marker genes (a proxy for the abundance of each 
cluster in each sample) varied considerably across clusters and 
between samples (Figures 2B and 2C). To limit any downstream 
analysis to high-confidence copy-number estimates, we therefore 
considered only genome clusters with sufficient coverage in a 
sample (which we term “detectable” clusters; Experimental Pro- 
cedures). We identified a total of 70 clusters that were detectable 
in at least one sample, with an average of 1 6 detectable clusters in 
each sample (Table S2). Overall, this analysis assigned copy num- 
ber values to ~1 .5 million KO-cluster-sample triplets, estimating 
the copy number of thousands of KOs across a large array of 
genome clusters in >100 samples (Table S3). 

This dataset of copy-number estimates provides a first large- 
scale account of gene-level strain variation among organisms 
common to the human gut. Below, we mine this dataset to 
explore neutral and adaptive variation in this highly complex 
ecosystem in a manner that goes beyond species-level com- 
parative analysis. Importantly, this dataset and the pipeline 
described above can serve as a valuable resource for future 
studies of compositional shifts in the human microbiome and 
in other environments, linking metagenome-level differences in 
gene abundance to genome-level variation. 

Identifying Genes with Highly Variable and with 
Set-Specific Variable Copy Number 

Given the copy-number estimates obtained above, we set out to 
identify specific KOs in specific clusters (KO-cluster pairs, or 
KCs) whose copy number varied across samples. Notably, to 
detect variation, we compared the copy number of each KC 
across different samples rather than comparing the estimated 
copy number in any given sample to the copy number in a refer- 
ence genome, avoiding spurious variation predictions that may 
result from annotation errors or bias in the set of reference 
genomes. Clearly, many clusters can be detected in only a 
few samples. To confidently detect copy-number variation, we 
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Figure 1. Schematic of Analysis Pipeline 

Reads from metagenomic samples were mapped to KEGG-annotated reference genomes, grouped into species-level genome clusters. The total coverage of 
each KO (KEGG orthology group), k, in each genome cluster, c, in each sample, s, was normalized by cluster abundance to calculate gene copy number V kcs . KCs 
(specific KOs in specific genome clusters) whose copy number varied significantly across samples were detected, as well as those whose copy number was 
associated with host state (obesity, IBD). 

See also Figure S3 and Table S3. 



therefore only considered the 40 clusters that were detectable in 
at least 10 samples. 

We first set out to identify KCs that exhibit extreme and 
prevalent variation across samples. Specifically, we calculated 
the level of inter-sample variation in the copy number of each 
KC and defined as highly variable those KCs whose variation 
was at least two standard deviations greater than the average 



variation of all KCs (Experimental Procedures). We used both 
cross-validation analysis and synthetic samples to confirm the 
robustness and accuracy of this approach (Extended Experi- 
mental Procedures; Figure S4C). In total, this analysis detected 
735 highly variable KCs spanning 261 KOs across 38 genome 
clusters (Figure 3; Table S4). The number of highly variable 
KCs in each cluster varied greatly, reaching up to 47 KCs in 
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Figure 2. Genome Cluster Statistics 

The mappability, abundance, and prevalence of each genome cluster (representing a species-level group) are shown in three vertically aligned plots. Clusters are 
sorted by their prevalence across samples. 

(A) Cluster mappability, as determined by a large-scale simulation assay measuring the accuracy of mapping reads extracted from the cluster’s genomes to a 
database in which the genome of origin was removed. In this simulation, reads from clusters represented in the reference database by a single genome (marked 
with a dot above the column) are expected to remain unmapped. 

(B) The distribution of each cluster’s abundance across samples, as determined by the average coverage of 13 single-copy marker genes. 

(C) Cluster prevalence (the number of samples in which the cluster was “detectable”) within each host group, shown as a stacked bar plot. 

See also Figures SI and S2 and Tables SI and S2. 



the Roseburia intestinalis cluster (representing 4.05% of the KCs 
in this cluster), with an average of 1 .79% of the KCs in each clus- 
ter (Table S5). We found no apparent relationship between the 
amount of variation observed in a cluster and the number of 
reference genomes in the cluster or the prevalence of the cluster 
across samples, but we did observe a tendency toward high 
variation in species from the Firmicutes phylum compared to 
other species (t test, p < 0.05; see also Figure 3). Although the 
majority of highly variable KOs (57.1%) were variable in just 
one cluster, certain KOs were variable across many clusters, 
with some KOs variable in ten or more different clusters. 

The analysis above focused on KCs that exhibit extreme vari- 
ation and on KCs that vary greatly across many different samples. 



Variation within other genes, however, may be more subtle and 
may reflect, for example, adaptive variation that can be observed 
in only a small set of samples. We therefore set out to additionally 
identify set-specific variable KCs, wherein the copy number of a 
given KC was relatively constant across most samples but devi- 
ated significantly in a small subset of the samples (Experimental 
Procedures). In this analysis, we further distinguished cases in 
which a KC exhibited a consistently high copy number in this sub- 
set of samples compared to all other samples (set-specific 
increased copy number) from cases in which a KC exhibited a 
consistently low copy number in this subset of samples (set-spe- 
cific decreased copy number) or in which it exhibited increased 
copy number in one subset and decreased in another. As 
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Figure 3. A Map of Variable KCs 

A matrix map representing the status of variable KOs (x axis) in each genome cluster (y axis). Colored bars represent variable KCs (highly variable KCs vary widely 
in copy number across all samples, whereas set-specific variable KCs are increased and/or decreased in copy number in only a small subset of the samples), 
while light gray bars indicate KCs with consistent copy number across samples, and KOs not present in a genome cluster are left white. Genome clusters are 
ordered by phylogeny, and KOs are ordered by hierarchical clustering. The bar chart to the right of the map represents the fraction of KOs in each cluster identified 
as variable. Above the map, certain groups of functionally related KOs are highlighted. The 314 KOs uniquely variable in the E. coli cluster (the majority of which 
have only been annotated in E. coli - like genomes) were excluded due to space constraints. 

See also Figure S4 and Tables S4-S6. 



expected, we found that set-specific variable KCs were much 
more common than highly variable KCs. In total, our analysis de- 
tected 5,004 set-specific variable KCs covering 1,859 KOs 
across the 40 genome clusters examined (Figure 3; Table S4). 
In general, we observed more cases of set-specific increased 
copy number than of set-specific decreased copy number, but 
this ratio shifted markedly across clusters, and in certain clusters 
(i.e., Clostridium sp., Streptococcus thermophiius) mostly set- 
specific decreased KCs were observed. 

Detected Variation Captures Both Known and Novel 
Strain Variation 

As validation of our pipeline and results, we compared the set of 
highly variable KCs obtained for each cluster to known variation 
among the cluster’s sequenced reference genomes. Clearly, the 
reference genomes in our database do not capture the full extent 
of intra-species variation in the gut microbiome. Similarly, our 
samples likely do not include much of the variation present in 
our reference genomes, as many of these reference genomes 
represent strains isolated from clinically distinct individuals, 
phenotypical ly diverse cohorts, or non-gut samples. Accordingly, 
a large number of genes that vary in copy number across refer- 
ence genomes may still exhibit consistent copy number across 
the gut samples analyzed above. Yet, the set of detected highly 
variable genes, which aims to include genes that vary frequently 
in their copy number across genomes, is likely to capture many 
instances of known variation in gene content among available 
reference genomes. Indeed, considering the 1 5 multiple-genome 
clusters in our database, a striking 81 % of the detected highly var- 



iable KCs also vary in copy number across reference genomes 
(Figure 4). Moreover, in seven of these clusters, all highly variable 
KCs also vary in copy number across reference genomes. 
Notably, six of these clusters contain at least three genomes, 
whereas the majority of the other clusters contain only two, sug- 
gesting that more sequenced strains may be needed to fully 
capture the variation associated with these clusters (and more 
importantly, with clusters for which only a single genome was 
available). Importantly, we demonstrated that a similar overlap 
can be observed when comparing predicted variation to known 
variation among a large collection of genomes not included in 
our database, confirming that this overlap is not an artifact of 
the specific reference genomes used in our analysis (Figures 4B 
and 4C; Extended Experimental Procedures). Comparison of 
set-specific variable KCs to known variation across reference ge- 
nomes again confirmed that the variation detected greatly over- 
lapped with known variation observed across sequenced strains 
(Figure S5). Notably, however, set-specific variable KCs also 
included many instances of novel variation, suggesting that the 
set of reference genomes currently available does not capture 
the full extent of copy-number variation in the gut. Comparison 
of detected set-specific variation to variation observed across 
two manually assembled Citrobacter strains further revealed sig- 
nificant overlap (Extended Experimental Procedures). 

Functions Associated with Variable Genes 

We examined whether the detected copy-number variation was 
associated with specific functions in each genome cluster. We 
first used enrichment analysis to identify functions that were 
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Figure 4. Comparison of Highly Variable KCs to Known Variation among Reference Genomes 

(A) In each Venn diagram, the gray circle represents the set of all KCs in a given genome cluster, the pink circle represents the fraction of those KCs exhibiting 
copy-number variation across the cluster’s reference genomes, and the red circle represents the set of KCs detected as highly variable. Overlap of the pink and 
red circles indicates correspondence between known and detected variation. Each diagram is labeled with the cluster ID, representative species name, and 
number of reference genomes. 

(B and C) Additional variation in reference genomes that were not used as mapping targets is represented by either an orange circle (additional reference ge- 
nomes from IMG) or a yellow circle (additional reference genomes from NCBI), compared to variation in included reference genomes (pink) and detected highly 
variable KCs (red). 

See also Figure S5. 



over-represented among the set of highly variable KCs in each 
cluster. We found that transport-related functions were over- 
whelmingly prone to high copy-number variation (Table S6). Spe- 
cifically, ten of the genome clusters analyzed were enriched for 
variation in KCs associated with transport annotations, including 
the general BRITE term “Transporter,” as well as more specific 
modules related to either sugar or iron complex transport. For 
example, within the Bacteroides ovatus cluster, seven of the clus- 
ter’s 66 transport-associated KCs were highly variable (Figure 5), 
including all three KCs (K02013, K02015, K02016) involved in a 
specific iron complex transport system module (M00240). Inter- 
estingly, significant variation in sugar transport functions was 
only found among clusters in the phyla Firmicutes and Actino- 
bacteria, whereas Bacteroidetes clusters were uniquely associ- 
ated with variation in the iron complex transport system (see 
Table S6). Studies of cultured organisms from various environ- 
ments and experimental evolution assays have suggested that 
loss, amplification, and acquisition of transport functions consti- 
tute a primary adaptive mechanism (Gevers et al., 2004; Heikki- 
nen et al., 2007; Lee and Marx, 2012; Sonti and Roth, 1989); 
here, we show that this flexibility in the copy number of transport 
genes likely extends to a considerable proportion of prevalent gut 
species and that, within this general class, specific transport 
genes may facilitate adaptation to the gut environment. 

We additionally found that motility-related KCs were highly 
variable in the Eubacterium rectale genome cluster. Specifically, 



in this cluster, 7 of the 38 highly variable KCs were bacterial 
motility proteins, of which four were structural flagellar compo- 
nents, two were involved in chemotaxis, and one was essential 
for twitching motility (Han etal.,2008). Motility proteins, and espe- 
cially flagellar proteins, are widely associated with virulence and 
immunostimulation, and the gain or loss of flagellar components 
is believed to be an important adaptive mechanism (Borziak et al., 
2013; Heikkinen et al., 2007; Al Mamun et al., 1997). Moreover, 
variation in these seven KCs was highly consistent within sam- 
ples; most samples contained either detectable copies of all 
seven KCs or no (or low number of) copies of all of these KCs (Fig- 
ure S6). Though we found no variation in the copy number of any 
of these genes among the three sequenced reference genomes 
included in the Eubacterium rectale cluster in our database, a 
recent study of 27 elderly gut metagenomes identified non-uni- 
form coverage of genes involved in the flagellum biogenesis path- 
ways of six Eubacterium and Roseburia species (Neville et al., 
2013), suggesting that the current reference genomes may not 
capture the full dynamic range of these species. 

Next, we considered the collection of set-specific variable 
KCs and examined their functional annotations. Interestingly, hi- 
erarchical clustering of set-specific variable KOs based solely on 
their variation profile across the 40 clusters revealed dis- 
tinct groups of functionally related genes that vary in a given 
genome cluster or within multiple clusters (Figure 3). For 
example, a large set of genes related to cell growth and 
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Figure 5. Copy Number of Highly Variable Transport KCs in Bacteroides ovatus 

The size and color of each square represent the copy number of each highly variable KC within each sample. Samples are grouped by host state (I, IBD; h, healthy; 
o, obese). The copy numbers of the 13 marker KCs in this genome cluster are illustrated for comparison. See also Figure S6. 



sporulation were all identified as set-specific variable KCs in the 
two genome clusters associated with Clostridium sp. Similarly, a 
set of sugar metabolism genes were all identified as set-specific 
variable KCs in Roseburia intestinalis, and a number of antibiotic 
resistance genes were identified as variable in multiple genome 
clusters, primarily those in the Firmicutes phylum. An enrichment 
analysis of functions associated with set-specific variable KCs in 
each cluster additionally revealed a number of important func- 
tions that were prone to copy-number variation (Table S6). For 
example, genes in the lipopolysaccharide biosynthesis pathway 
in Diaiister invisus and Clostridium sp. were often observed with a 
higher copy number in a small set of samples. Interestingly, vari- 
ation within functions related to sugar metabolism (i.e., KEGG 
pathways galactose metabolism, starch and sucrose meta- 
bolism, fructose and mannose metabolism, polyketide sugar 
unit biosynthesis) was observed primarily within Bacteroidetes 
clusters, whereas set-specific transport-related variation was 
almost absent from these clusters. Other functions enriched for 
set-specific variable KCs suggest transitions between virulent 
states, such as motility in butyrate-producing bacteria (NCBI 
accession FP929062), Eubacterium rectale, and Clostridium 
sp.; streptomycin biosynthesis in Acidaminococcus sp.; lyso- 
syme production in Bacteroides ovatus ; the EHEC/EPEC patho- 
genicity signature in Escherichia coir, and secretion systems 
in butyrate-producing bacteria (NCBI accession FP929062), 
Clostridium sp., and Escherichia coli. Within Escherichia coli, 
type II secretion system genes were identified as set-specific 
decreased copy-number KCs, whereas type III secretion system 
genes were identified as set-specific increased copy-number 
KCs. Overall, much of the observed variation appeared to be 
associated with the way a species responds to and interacts 
with its surroundings, highlighting the strong adaptive potential 
of gut-associated bacteria. 

Clearly, different cohorts could harbor different sets of strains 
owing to an assortment of ecological or host-specific factors, 
and accordingly different genes may vary in copy number in 



different datasets. Notably, however, analysis of a second data- 
set of 73 gut samples from a Chinese cohort (Qin et al., 2012) 
yielded a marked overlap with our original Danish/Spanish 
cohort in both the set of KCs identified as variable and 
the set of functions enriched for copy-number variation 
(Extended Experimental Procedures). These findings suggest 
that, although variation may be personal, certain genes and func- 
tions (e.g., those related to environmental adaptation) may be 
universally prone to variation. 

Host State- Associated Variation 

Although much of the variation across strains may reflect neutral 
processes or transitory dynamics, some variation may repre- 
sent adaptation to a specific host phenotype. To detect such 
potentially adaptive variation, we identified variable KCs in which 
the copy number in samples from obese or IBD subjects was 
significantly different than in samples from healthy subjects 
(Experimental Procedures). In total, we found 24 KCs whose 
copy number was significantly associated with IBD and three 
KCs whose copy number was significantly associated with 
obesity (FDR < 0.05; Table S7). 

Interestingly, a number of these KCs have been previously 
implicated in adverse host health states. For example, in our 
analysis, obesity was associated with a higher copy number of 
thioredoxin 1 (K03671) in Clostridium sp. (Figure 6A), and indeed 
thioredoxin reductase was recently shown to be enriched in the 
cecal metaproteome of mice fed a high-fat diet (Daniel et al., 
2014). Such results are consistent with thioredoxin’s regulatory 
role in maintaining redox equilibrium and the demonstrated 
links between a high-fat diet and oxidative stress in mammals 
(Furukawa et al., 2004). Additionally, in our analysis, the loss of 
a ubiquinone-reducing gene (K00349; nqrD) from Bacteroides 
plebeius was associated with obesity. A recent study in mice 
showed that supplemental ubiquinone reduced inflammation 
and metabolic stress accompanying a high-fat high-fructose 
diet by reducing the expression of certain genes associated 
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Figure 6. Copy-Number Variation of Host State-Associated KCs 

Two KCs whose copy number was significantly increased in samples from a specific host state are shown. The size and color of each square represent the copy 
number of the KC within each sample. 

(A) The copy number of thioredoxin 1 (K03671) in Clostridium sp. is significantly increased in samples from obese subjects. 

(B) The copy number of an MFS transporter gene (K0821 7) in the Roseburia inulinivorans genome cluster is significantly increased in samples from IBD subjects. 
See also Table S7. 



with stress-response (Sohet et al., 2009), while mice not 
receiving the supplement gained more weight than their counter- 
parts. Importantly, however, ubiquinol, the reduced form of 
ubiquinone, has recently been shown to be the more readily ab- 
sorbed and more active form of the compound (Langsjoen and 
Langsjoen, 2014), raising the possibility that loss of microbial 
ubiquinone-reducing capabilities from certain species may hind- 
er the effectiveness and protective capacity of ubiquinone in the 
host. Other findings shed new light on the role of individual spe- 
cies in disease, with evidence of variation associated with com- 
mon disease hallmarks, such as pathogenicity-related secretion 
and antibiotic resistance. In Roseburia inulinivorans (Figure 6B), 
increased copy number of a gene (K08217) coding for a major 
drug efflux protein known to play a role in antibiotic resistance 
was highly associated with IBD-afflicted individuals. Similarly, 
HlyD (KOI 993), an essential component of RTX hemolytic toxin 
secretion (Pimenta et al., 2005), exhibited increased copy num- 
ber in IBD samples in Bacteroides uniformis. See Table S7 for 
a full list of disease-associated KCs. Interestingly, none of the 
obesity-associated KCs and only 3 of the 24 IBD-associated 
KCs were found to vary significantly in the Chinese cohort 
described above, among whom only one individual was obese 
and none were reported as having IBD. 



Strain-Level Deconvolution of Microbiome Composition 
and Intra-Species Population Structure 

Clearly, the microbiomes of different individuals can house 
multiple strains of the same species with potentially different 
relative abundances. Our copy-number estimates for each clus- 
ter accordingly represent average copy numbers across the 
different strains in the sample. Next, we therefore examined 
whether these estimates can be used to obtain insights into 
strain-level population structure, going beyond species-level 
composition assays and focusing specifically on the composi- 
tion of strains within each genome cluster rather than on the 
abundance of the cluster itself. 

First, we explored how well the copy-number profiles obtained 
for each genome cluster in each sample can be explained by 
known reference strains, using a regression analysis to decon- 
volve these copy-number profiles into a linear combination of 
the strains included in our database (Experimental Procedures). 
Obviously, these strains may not encompass the full set of 
strains present in the samples analyzed, yet such an analysis 
may be useful in examining what portion of the observed varia- 
tion can be accounted for by known strains and what portion 
represents potentially novel variation. Indeed, we found that, in 
well-characterized clusters with many sequenced genomes, 
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Figure 7. Predicted Strain-Level Population Structure within Clostridium sp 

(A) A linear regression analysis was used to model the copy-number profile obtained for cluster 110 ( Streptococcus thermophilus) in each sample as a com- 
bination of known reference genomes, with prediction weights shown as stacked colored bars. Prediction accuracy (R 2 ) is indicated above each bar. Samples 
with low or negative R 2 values potentially contain variation that cannot be explained by any combination of known reference genomes. 

(B) A principal coordinate analysis depicting the differences between the copy-number profiles obtained for this genome cluster in the various samples (open 
circles), as well as the copy-number profiles of reference genomes (filled circles). 

See also Figure S7. 



the copy-number profiles of most samples could be well ex- 
plained by a linear combination of known strains. For example, 
in the Escherichia coli cluster that comprised 63 sequenced 
genomes in our database, 76% of the variation in copy number 
could be explained on average by these genomes (R 2 = 0.76 ± 
0.12). In this cluster, the inferred representation of each strain 
differed widely across samples, with some strains (i.e., Escheri- 
chia coli Oil 1:H- str. 1 1 128) highly represented across multiple 
samples and others found in just one sample. However, for less 
well-characterized clusters with only a few known strains in our 
database, in some cases just a subset of the observed copy- 
number variation could be explained. For example, the four 
known strains of Streptococcus thermophilus could be used to 
explain a majority of the variation observed in some of the sam- 
ples (R 2 > 0.5) yet failed to explain the variation observed in four 
of the samples (R 2 < 0), suggesting the existence of potentially 
novel, yet-to-be-sequenced variation (Figure 7A). 

To further compare copy-number variation profiles across 
samples and to examine variation that may not be captured by 
known strains (including notably, in clusters comprising only 
one known strain), we used a principal coordinate analysis. 
This analysis revealed a complex population structure within 
each cluster, with marked differences among samples indi- 
cating the prevalence of personalized variation. For a number 
of genome clusters, however, samples appear to group into 
distinct sets, potentially reflecting individuals with similar 
intra-species population structures (Figure 7B). Moreover, by 
including the reference genomes in this principal coordinate 
analysis, we were able to distinguish previously captured 



variation versus novel variation observed across samples. For 
example, the principal coordinate plot for the Streptococcus 
thermophilus genome cluster (Figure 7B) clearly demonstrates 
that, although the copy-number profiles of most samples clus- 
tered tightly with several known reference genomes, the four 
poorly explained samples mentioned above clustered together 
and contained variation that was distinct from any reference 
genome. Such a pattern may indicate the presence of novel 
shared strains, providing a promising basis for targeted seq- 
uencing. Similar patterns were also observed in other clusters, 
in which a distinct, tightly clustered subset of samples or individ- 
ual samples exhibit markedly different copy-number profile from 
that of any sequenced genome (Figures S7A and S7B). Overall 
though, each genome cluster exhibited a unique population 
structure across individuals, highlighting the complex suite of 
forces governing taxonomic composition in the gut (Levy and 
Borenstein, 2013). 

DISCUSSION 

By and large, closely related organisms tend to encode similar 
sets of genes. This consistency is in fact often used to infer func- 
tional capacity from taxonomy (Langille et al., 2013; Zaneveld 
et al., 2010). Clearly, however, this relationship between phylog- 
eny and gene content is imperfect, and each species represents 
a large collection of strains that differ in the set of genes they 
encode, the copy number of these genes, and ultimately, their 
functional capacity. Above, we have focused on identifying in- 
stances in which this relationship between microbial species 



Cell 160, 583-594, February 12, 2015 ©2015 Elsevier Inc. 591 






Cell 



and genes breaks, presenting a large-scale analysis of copy- 
number variation in a diverse array of gut species. Our analysis 
has demonstrated that copy-number variation is prevalent in 
the gut environment, with some species exhibiting significant 
copy-number variation in >20% of their genes. Such variation 
may induce significant microbiome-wide shifts and may account 
for at least some of the observed discrepancies between trends 
observed at the species levels versus trends measured at the 
gene level. Moreover, intra-species variation was shown to be 
especially prevalent in genes involved in specific functions, 
most notably functions that impact the way an organism inter- 
acts with its environment such as transport and signaling pro- 
cesses. This may suggest an adaptive dynamic by which certain 
species respond to changes in community composition or in the 
gut niche and a potentially crucial role of the gut environment 
in shaping bacterial evolution (Levy and Borenstein, 2013; 
Shapiro et al., 2012). Other highly variable functions, such as 
lipopolysaccharide biosynthesis, cell motility, and secretion sys- 
tems, may represent changes in virulence as organisms respond 
to host immune responses. Interestingly, many of these same 
functions were highlighted in a previous study as more difficult 
to accurately correlate with 16s data (Langille et al., 2013). Our 
analysis further identified variable functions that may correlate 
with host states, exhibiting differential copy number in specific 
genomes. It remains unclear, however, whether such host 
state-associated variation is a cause or an effect. Our framework 
additionally facilitated the inference of intra-species population 
profiles for each individual, suggesting that most individuals har- 
bor multiple strains of each species. 

Though still far from an exhaustive catalog of strains that may 
be present across all human gut microbiomes, the framework 
presented above represents the most comprehensive account 
of copy-number variation in the human gut microbiome to 
date. It is our hope that this framework and the results presented 
here will inform future studies of strain-level microbiome compo- 
sition, demonstrating the extent of functional information that is 
lost by limiting characterization to the level of species and 
prompting further investigation and sequencing of strain-level 
features. Yet, there are clearly a number of caveats that should 
be considered in designing such future efforts. First, our analysis 
is limited to the detection of variation in gut species for which at 
least one fully sequenced genome is available, and future studies 
may benefit from additional genomes. Notably though, we did 
not detect significantly more variation in clusters for which 
more reference genomes were available. In addition, our pipe- 
line was designed to detect gene losses or amplifications but 
cannot identify gain of genes that are not present in any of the 
reference genomes included in the genome cluster. Such gain 
or transfer events may represent an additional substantial source 
of intra-species variation (Smillie et al., 2011 ). Our framework 
could, however, further facilitate future efforts to study sequence 
divergence among duplicated genes, informing our view of neo- 
functionalization and conservation processes in the microbiome. 
Notably, in our analysis, we focused on detecting high-confi- 
dence instances of variation, applying conservative parameters 
for read alignment and for variability calling. Specifically, we limit 
our analysis to “detectable” genome clusters, defined as those 
with >1 x coverage in the sample. Our analysis of a synthetic da- 



taset confirmed that, in such clusters, copy-number estimates 
can be inferred with 96% accuracy but that prediction accuracy 
dropped significantly in genome clusters with lower coverage 
(Figure S4B and Extended Experimental Procedures). With 13 
million reads per sample (the lowest sequencing depth in the 
cohort analyzed), species that comprise >0.4% of the sample 
are likely to be considered detectable by our pipeline (while a 
higher sequencing depth of a sample will clearly allow analysis 
of even rarer species). Future studies may relax some of these 
parameters or incorporate additional information (e.g., gene 
conservation) to detect more subtle variation. Finally, as with 
most studies relating microbiome composition to function, our 
analysis relies on the availability of functional gene databases, 
which may contain incomplete or erroneous annotations. By 
considering variation across samples rather than variation from 
reference genomes, our analysis is largely robust to such anno- 
tation inaccuracies. Interestingly, however, variable KCs identi- 
fied by our analysis were much more likely to lack a functional 
annotation than non-variable KCs, suggesting that much of the 
detected variation in gene content has as yet uncharacterized 
consequences. Combined, these results highlight both the 
need for additional genome sequences and the importance of 
continued efforts for characterizing gene function. 

Ultimately, analysis of intra-species variation in microbial com- 
munities is crucial for understanding the complex relationship 
between species composition and community-level functional 
capacity. Our analysis, quantifiably characterizing such variation 
in the gut microbiome, is an important first step in this direction, 
and the resulting dataset provides an essential resource for 
future predictive studies. 

EXPERIMENTAL PROCEDURES 

Metagenomic Samples and Reference Genomes 

Gut metagenomic data for 109 Danish and Spanish individuals, including indi- 
viduals afflicted with obesity or IBD, was obtained from (Qin et al., 2010). A list 
of 261 dominant and prevalent human gut microbial strains, grouped into 101 
genome clusters (Table SI) based on sequence similarity of 40 marker genes, 
was obtained from (Schloissnig et al., 2013). Nucleotide contig sequences, 
gene calls, and amino acid protein sequences were downloaded for each 
genome, and protein sequences were annotated with KEGG orthologous 
groups (KOs). See Extended Experimental Procedures for more details. 

Calculation of Copy-Number Estimates 

Shotgun metagenomic reads were aligned to the set of reference genomes 
with BWA, using parameters and filters carefully validated by extensive simu- 
lation analyses (Figures SI and S2; Extended Experimental Procedures). In 
total, 2,469,102,286 reads were mapped. Average coverage over each gene 
region was determined using samtools (Li et al., 2009), and the coverage of 
each KC (KO-cluster pair) was obtained by summing over all genes annotated 
with the same KO and genome cluster. KC coverage was normalized by cluster 
abundance, defined as the average coverage over a set of 1 3 universal marker 
KOs (Figure S3B; Extended Experimental Procedures), to obtain the estimated 
copy number V kcs of each KO k, in each cluster c, and in each sample s. 
“Detectable KCs” in a sample were defined as those with V kcs > = 0.5. “Detect- 
able clusters” within each sample were defined as those with at least 12 
detectable marker KCs and average marker coverage >1. KCs that were 
not detectable in any sample were removed from the analysis. 

Detection of Highly Variable and Set-Specific Variable KCs 

For each of the 40,088 KCs present in clusters detectable in at least ten sam- 
ples, the median copy number (baseline) across samples and the MAD 
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(median absolute deviation) from this baseline were calculated. KCs with a 
MAD more than 2 SDs from the MAD distribution mean (MAD > 0.6346) 
were considered highly variable. KCs in which at least 10% of samples had 
a copy number that exceeded the baseline by this threshold were considered 
set-specific increased variable KCs. Set-specific decreased KCs were similarly 
defined as KCs in which at least 10% of samples had a copy number that fell 
below the baseline by this threshold. 

Detection of Host State-Associated KCs 

A KC was defined as obesity associated if the copy numbers in samples from 
obese individuals were significantly higher or significantly lower than the copy 
numbers in samples from non-obese individuals, according to a two-sample 
t test (FDR-corrected p < 0.05). IBD-associated KCs were similarly defined. 
Samples that were labeled as both obese and IBD were omitted from this 
analysis. 

Copy-Number Profile Deconvolution and Principal Coordinate 
Analysis 

For each sample, a non-negative least-squares linear regression analysis was 
performed to obtain the linear combination of reference genomes in each 
multi-genome cluster, optimally explaining the copy-number estimates of 
variable KCs. The regression was constrained such that the sum of genome 
weights for each sample and cluster equaled one. Prediction error was defined 
as the R 2 value for each sample. A principal coordinate analysis was also per- 
formed for every genome cluster, operating on the pairwise Euclidian distance 
matrix of set-specific variable KC copy numbers in each sample and each 
sequenced reference genome. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Extended Experimental Procedures, seven 
figures, and seven tables and can be found with this article online at http://dx. 
doi.org/1 0. 1 01 6/j. cell. 201 4.1 2.038. 
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SUMMARY 

Functional micropeptides can be concealed within 
RNAs that appear to be noncoding. We discovered 
a conserved micropeptide, which we named myore- 
gulin (MLN), encoded by a skeletal muscle-specific 
RNA annotated as a putative long noncoding RNA. 
MLN shares structural and functional similarity with 
phospholamban (PLN) and sarcolipin (SLN), which 
inhibit SERCA, the membrane pump that controls 
muscle relaxation by regulating Ca 2+ uptake into the 
sarcoplasmic reticulum (SR). MLN interacts directly 
with SERCA and impedes Ca 2+ uptake into the SR. 
In contrast to PLN and SLN, which are expressed in 
cardiac and slow skeletal muscle in mice, MLN is 
robustly expressed in all skeletal muscle. Genetic 
deletion of MLN in mice enhances Ca 2+ handling in 
skeletal muscle and improves exercise performance. 
These findings identify MLN as an important regulator 
of skeletal muscle physiology and highlight the possi- 
bility that additional micropeptides are encoded in 
the many RNAs currently annotated as noncoding. 

INTRODUCTION 

Ca 2+ controls the normal function of striated muscle by acting as 
the primary regulator of the sarcomeric contractile machinery and 
as a second messenger in the signal transduction pathways that 
control muscle growth, metabolism, and pathological remodel- 
ing (Bassel-Duby and Olson, 2006; Berchtold et al., 2000). Ca 2+ 
handling in striated muscle is tightly regulated by Ca 2+ pumps 
in the sarcoplasmic reticulum (SR) and plasma membranes that 
maintain intracellular Ca 2+ levels ~1 0,000-fold lower than extra- 
cellular and SR concentrations (Berridge et al., 2003; Rossi and 
Dirksen, 2006). Upon muscle stimulation, Ca 2+ release by the 
ryanodine receptor (RyR) in the SR membrane transiently in- 
creases Ca 2+ levels in the cytosol, triggering actomyosin cross- 
bridge formation within the sarcomere to generate contractile 
force. Reuptake of Ca 2+ into the SR by sarcoplasmic reticulum 

CrossMark 



Ca 2+ -ATPase (SERCA) is necessary for muscle relaxation and 
restores SR Ca 2+ levels for subsequent contraction-relaxation 
cycles. SERCA serves as a central regulator of striated muscle 
performance and the pathological signaling pathways that drive 
cardiovascular and skeletal muscle disease (Dorn and Molkentin, 
2004; Goonasekera et al., 201 1 ; Odermatt et al., 1 996; Pan et al., 
2003; Periasamy and Kalyanasundaram, 2007). 

Two related peptides, phospholamban (PLN) and sarcolipin 
(SLN), directly interact with SERCA in the SR membrane to regu- 
late Ca 2+ pump activity (Kranias and Hajjar, 2012; MacLennan 
and Kranias, 2003). PLN and SLN are expressed in partially over- 
lapping patterns in cardiac and slow skeletal muscle and are 
important regulators of muscle performance and cardiovascular 
disease (Briggs et al., 1992; Kranias and Hajjar, 2012; Minami- 
sawa et al., 2003; Schmitt et al., 2003; Tada and Toyofuku, 
1 998; Tupling et al., 201 1). PLN-deficient mice exhibit enhanced 
myocardial contractile performance, characterized by increased 
ventricular relaxation rates and SERCA pump activity (Chu et al., 
1998; Luo et al., 1994). Similarly, loss of PLN or SLN expression 
significantly increases the rate of muscle relaxation and SERCA 
pump activity in slow skeletal muscle but does not affect fast skel- 
etal muscles, which do not express PLN or SLN (Slack et al., 1 997; 
Tupling et al., 201 1 ; Vangheluwe et al., 2005). The absence of PLN 
and SLN expression in fast skeletal muscle, the dominant muscle 
type in mice, suggests that an unidentified factor regulates Ca 2+ 
handling and the contractile performance of this tissue. 

Recent genome-wide studies have suggested that hundreds 
of functional micropeptides may be encoded in vertebrate long 
noncoding RNAs (IncRNAs) (Andrews and Rothnagel, 201 4; Baz- 
zini et al., 2014). The microproteome has largely been over- 
looked in gene annotations, primarily because of the difficulty 
in identifying functional small open reading frames (ORFs) in 
RNA transcripts. While analyzing an annotated skeletal 
muscle-specific IncRNA, we discovered a previously unrecog- 
nized ORF encoding a conserved 46 amino acid micropeptide, 
which we named myoregulin (MLN). MLN forms a single trans- 
membrane alpha helix that interacts with SERCA in the mem- 
brane of the SR and regulates Ca 2+ handling. Consistent with 
this function, deletion of MLN in mice significantly enhances 
Ca 2+ handling and improves exercise performance. These 
findings identify MLN as the predominant SERCA-inhibitory 
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Figure 1. Discovery of a Skeletal Muscle- 
Specific Micropeptide 

(A) A short ORF encoding a conserved micro- 
peptide, which we named myoregulin (MLN), is 
contained within exon 3 of an annotated IncRNA in 
human and mouse genomes. The position of the 
MLN ORF is indicated in red. 

(B) In situ hybridization showing skeletal muscle- 
specific expression of MLN at the indicated em- 
bryonic time points, hrt, heart; in, intestine; liv, 
liver; lu, lung; m, myotome; to, tongue. 

(C) Northern blot of RNA isolated from adult mouse 
tissues using a probe specific to the full-length 
MLN transcript shows skeletal muscle-specific 
expression. 

(D) Diagram of the constructs used for in vitro 
translation of the MLN micropeptide. The full-length 
MLN RNA transcript was subcloned into the CS2 
vector containing the SP6 phage RNA polymerase 
promoter (SP6-MLN). A frameshift mutation was 
introduced immediately after the endogenous ATG 
to disrupt the MLN ORF (SP6-MLN-FS). 

(E) Coupled in vitro transcription and translation 
reactions of the SP6-MLN vector using radio- 
labeled 35 S-methionine produced a ~5 kDa mi- 
cropeptide, visualized by Tricine SDS-PAGE. The 
frameshift mutation in the MLN ORF (SP6-MLN- 
FS) abolished any detectable expression. 

(F) Targeting strategy using CRISPR/Cas9-medi- 
ated homologous recombination to knock in a 
FLAG epitope tag into the MLN locus in C2C12 
cells. PCR-based genotyping using primers (PI- 
PS) or RFLP analysis of PCR products generated 
using primers (P4 and P5) was used to verify 
correct targeting. 



(G) RFLP analysis of WT C2C12 and heterozygous C2C12 myoblasts for the MLN-FLAG knockin allele. 

(H) Western blot analysis showing endogenous expression of the MLN-FLAG fusion peptide in differentiated C2C12 myotubes, detected with an anti-FLAG 
antibody. 

See also Figure SI . 



micropeptide in skeletal muscle, which surprisingly was con- 
cealed in an RNA annotated as noncoding. 

RESULTS 

Discovery of a Conserved Micropeptide Encoded by a 
IncRNA 

In a bioinformatic screen for uncharacterized skeletal muscle- 
specific genes, we identified a vertebrate RNA transcript anno- 
tated as a IncRNA (LINC00948 in humans and 2310015B20Rik 
in mice). Analysis of the evolutionary conservation of these tran- 
scripts identified a short 1 38 nucleotide ORF with the potential to 
encode a highly conserved 46 amino acid micropeptide, which 
we named myoregulin (MLN) (Figure SI A available online). The 
human and mouse MLN genes consist of three exons that 
span 16.5 and 15.0 kb, respectively, with the ORF located in 
exon 3 (Figure 1 A). Nucleotide insertions and deletions that could 
alter the reading frame flank the MLN ORF, demonstrating that 
these sequences comprise UTRs (Figure SI A). 

During embryogenesis, MLN is expressed in the myotomal 
compartment of the somites, the anlagen of skeletal muscle (Fig- 
ure 1 B). During fetal and adult stages, MLN is robustly expressed 
in all skeletal muscles and is not detectable in cardiac or smooth 



muscles (Figures 1 B and 1 C). MLN transcripts are also present in 
C2C12 myoblasts and myotubes, but not in 10T1/2 fibroblasts 

(Figure SIB). 

To determine if the MLN ORF is translated as a micropeptide, 
we transcribed and translated the full-length MLN RNA in vitro 
in the presence of radiolabeled 35 S-methionine (Figures ID 
and 1 E). A single ~5 kDa micropeptide was produced from the 
MLN transcript, whereas a frameshift mutation that disrupted 
the MLN ORF abolished any detectable expression (Figures ID 
and IE). We further cloned a FLAG epitope tag in-frame with 
the C terminus of the MLN coding sequence within the full-length 
MLN transcript (Figure SIC). Expression of this construct in 
COS7 cells yielded a peptide of ~6 kDa, corresponding to the 
predicted molecular weight of the MLN-FLAG fusion peptide, 
detected by western blot (Figure SID). 

To determine whether MLN is endogenously expressed in 
skeletal muscle, we introduced the same FLAG epitope tag 
into the MLN locus in C2C12 muscle cells using CRISPR/ 
Cas9-mediated homologous recombination (Figures IF and 
S1E). PCR-based genotyping and restriction fragment-length 
polymorphism (RFLP) analyses were used to verify correct tar- 
geting (Figures 1G and SI F). As shown by western blot analysis 
in Figure 1H, C2C12 cells heterozygous for the MLN-FLAG 
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Figure 2. MLN Forms a Transmembrane 
Alpha Helix that Interacts with SERCA 

(A) The secondary structure (SS) of MLN is pre- 
dicted to contain an N-terminal beta sheet and 
C-terminal transmembrane alpha helix (E, beta 
sheet; H, alpha helix). Alignment of mouse MLN, 
PLN, SLN, and the invertebrate ortholog SCL 
identified identical (green) and similar (blue) resi- 
dues conserved in all four micropeptides. 

(B) The structure of the transmembrane helices 
of MLN, SLN, PLN, and SCL was modeled using 
l-TASSER and showed a common spatial orien- 
tation of conserved residues. 

(C) Automated protein docking using ClusPro 
predicted that MLN occupies the same groove in 
SERCA1 that is recognized by SLN. 

(D) Two-photon laser scanning confocal micro- 
scopy showing colocalization of GFP-MLN and 
mCherry-SERCAl fusion proteins in the SR of 
mature skeletal muscle fibers. M, M-line; Z, Z-line. 

(E) Retroviral expression of an N-terminal HA- 
tagged MLN fusion peptide (HA-MLN) in C2C12 
myoblasts was enriched in the subcellular fraction 
containing SR/ER membrane proteins. Enrich- 
ment for cytosolic, SR/ER membrane, and plasma 
membrane proteins was verified by western 
blot analysis for Hsp90, PDI, and N-cadherin, 
respectively. 

(F) Coimmunoprecipitation (colP) experiments 
with HA-tagged MLN alanine mutants and a Myc- 
tagged SERCA1 construct transfected into COS7 
cells identified residues important for interaction 
of MLN with SERCA1. Interaction of MLN with 
SERCA1 was abolished by mutation of residues 
shared with PLN, SLN, and SCL (L29A, F30A, and 
F33A), but not by the charged residues K27A or 
D35A. Western blot performed with anti-HA (WB- 
HA) or anti-Myc (WB-Myc). 

See also Figure S2. 
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knockin allele expressed a micropeptide of the predicted molec- 
ular weight of the MLN-FLAG fusion peptide. Together, these 
data reveal that the MLN RNA, which is annotated as noncoding, 
in fact encodes a cryptic micropeptide. 

MLN Encodes a Transmembrane Alpha Helix 

The MLN micropeptide is highly conserved across mammals 
and is predicted to form a type II single-pass transmembrane 



alpha helix in which the 19 N-terminal 
amino acid residues are exposed to 
the cytosol and the three C-terminal res- 
idues are lumenal (Figures 2A and S2A). 
The N-terminal residues (5-10) of MLN 
are predicted to fold into a small beta 
sheet, and the C-terminal half of the 
protein contains the predicted trans- 
membrane alpha helix (Figures 2A and 
S2A). Circular dichroism spectroscopy 
measurements of synthesized full-length 
MLN peptide in detergent micelles 
confirmed it consists of alpha helix 
(43.6%), beta sheet (21.1%), turn (9.6%), and random coil 
(25.7%) structures (Figure S2B). 

Association of MLN with SERCA 

MLN shows a strong structural resemblance to PLN and SLN, 
which also form type II single-pass transmembrane peptides ori- 
ented with their N-terminal residues exposed to the cytosol. 
Sequence alignment of MLN, PLN, and SLN revealed a number 
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of identically conserved residues in their transmembrane re- 
gions, which are also found in the invertebrate ortholog, sarco- 
lamban (SCL) (Figure 2A) (Magny et al., 2013). Structural 
modeling of the transmembrane helices showed that all four 
peptides are predicted to form alpha helical structures with res- 
idues arranged in a similar spatial pattern (Figure 2B). PLN and 
SLN both interact with SERCA in the membrane of the SR in a 
common groove formed by the M2, M6, and M9 helices of 
SERCA (Figure 2C) (Toyoshima et al., 2003, 2013; Winther 
et al., 2013). Automated protein docking of the transmembrane 
model of MLN with the crystal structure of SERCA revealed 
that MLN aligned in the same groove that is occupied by PLN 
and SLN (Figure 2C). 



Figure 3. MLN Regulates SR Ca 2+ Levels by 
Inhibiting SERCA Pump Activity 

(A and B) The Ca 2+ dependence of the relative rate 
of Ca 2+ uptake is shown for homogenates from 
HEK293 cells cotransfected with SERCA1 and the 
indicated constructs. Cotransfection with MLN, 
PLN, or SLN resulted in a similar decrease in Ca 2+ 
uptake, corresponding to a decreased affinity 
of SERCA for Ca 2+ , relative to empty vector 
(Control). For comparison, untransfected cells 
and SERCA1 -expressing cells treated with the 
SERCA inhibitor thapsigargin (1 00 nM) are shown. 
The activity of the full-length RNA transcript en- 
coding the MLN ORF (MLN RNA) is abolished 
by a frameshift mutation in the MLN ORF (MLN- 
RNA FS). 

(C) Retroviral cotransduction of C2C12 myoblasts 
with the FRET-based Ca 2+ sensor TIER with MLN 
or SLN was used to directly measure the relative 
levels of SR Ca 2+ . Both MLN and SLN significantly 
decreased SR Ca 2+ levels relative to an empty 
retroviral vector. 

(D) Retroviral overexpression of MLN or SLN in 
C2C12 myoblasts treated with 4-CMC and 
imaged with fura-2-AM showed decreased levels 
of SR Ca 2+ , measured by peak Ca 2+ release from 
the SR. 

Data are presented as mean ± SEM (*p < 0.05 
compared to pBx-empty). See also Table SI . 



A band and overlapped with the localiza- 
tion of an mCherry-SERCAl fusion pro- 
tein within the SR (Figure 2D). This pattern 
was consistent with the localization of 
GFP-PLN and GFP-SLN fusion proteins 
(Figure S2C). Additionally, an N-terminal 
HA-tagged MLN fusion protein (HA-MLN) expressed in C2C12 
myoblasts was enriched in the subcellular fraction containing 
SR/ER membrane proteins (Figure 2E), altogether suggesting 
that MLN functions in the membrane of the SR. 

In coimmunoprecipitation experiments, the HA-MLN fusion 
protein formed a stable complex with SERCA1 (skeletal muscle 
specific), SERCA2a (cardiac and slow skeletal muscle specific), 
and SERCA2b (ubiquitous) isoforms (Figure S2D). Alanine muta- 
genesis of MLN residues shared with PLN and SLN (L29A, F30A, 
and F33A) abolished the ability of MLN to interact with SERCA1 , 
whereas mutation of charged residues (K27A, D35A) did not alter 
this interaction (Figure 2F). These findings suggest that MLN, 
PLN, and SLN share a common hydrophobic binding motif that 
stabilizes their association with SERCA. 




pBx pBx- pBx- 
empty MLN SLN 



MLN Is Embedded in the SR Membrane and Colocalizes 
with SERCA 1 

To examine the subcellular localization of MLN in vivo, we elec- 
troporated a plasmid encoding GFP fused to MLN (GFP-MLN) 
into the flexor digitorum brevis muscle of adult mice. Two- 
photon laser scanning confocal microscopy allowed for the 
simultaneous detection of GFP fluorescence and second har- 
monic generation (SHG) to visualize the myosin A band of the 
sarcomeres (Nelson et al., 2013). The GFP-MLN fusion protein 
localized in a repeating pattern that alternated with the myosin 



MLN Regulates Ca 2+ Handling by Inhibiting SERCA Pump 
Activity 

PLN and SLN both function to inhibit Ca 2+ reuptake into the 
SR by lowering the affinity of SERCA for Ca 2+ , without altering 
the maximal rate of Ca 2+ pump activity (V max ). To determine 
if MLN regulates SERCA activity in a similar manner, we 
directly measured Ca 2+ -dependent Ca 2+ -ATPase activity in 
homogenates from HEK293 cells expressing SERCA1 (Fig- 
ure 3A). Similar to the effects of PLN and SLN, expression of 



598 Cell 160 , 595-606, February 12, 2015 ©2015 Elsevier Inc. 





Cell 



SERCA1 

Skeletal 



Cardiac non- 
muscle 




Figure 4. Developmental and Adult Expres- 
sion of MLN, PLN, and SLN in the Mouse 

(A and B) Real-time PCR showing the relative 
expression of MLN, PLN, SLN, and SERCA iso- 
forms across multiple skeletal muscles, cardiac, 
and nonmuscle tissues isolated and pooled from 
three adult 8-week-old C57BI/6 male mice. See 
also Figure S3. 
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MLN caused a significant reduction in the rate of Ca 2+ uptake, 
measured as an increase in K Ca (Figure 3A; Table SI). Coex- 
pression of the full-length RNA encoding MLN (MLN RNA) re- 
sulted in a similar decrease in Ca 2+ uptake compared to a vec- 
tor containing only the MLN coding sequence (Figure 3B). The 
activity of the MLN RNA was abolished by introduction of a 
frameshift mutation to disrupt the expression of the MLN micro- 
peptide (MLN-RNA-FS), demonstrating that the RNA itself does 
not inhibit SERCA activity (Figure 3B). No effects on V max were 
observed under any conditions tested, and the addition of 
thapsigargin, a potent inhibitor of SERCA activity, abolished 
Ca 2+ uptake (Figure 3A). 

Because the concentration of SR Ca 2+ is dependent upon 
SERCA reuptake activity, we examined if MLN overexpression 
could alter SR Ca 2+ levels in C2C12 myoblasts. SR Ca 2+ levels 
were directly measured using retroviral delivery of the fluores- 
cence resonance energy transfer (FRET)-based, SR-localized 
Ca 2+ sensor TIER, using culture conditions previously shown 
to be sensitive to changes in endogenous SERCA activity 
(Brandman et al. , 2007). Coexpression of MLN or SLN sig- 
nificantly decreased the levels of SR Ca 2+ , measured as a 
decrease in T1 ER FRET (Figure 3C). We alternatively compared 
SR Ca 2+ levels using the ratiometric Ca 2+ indicator dye fura-2- 
AM, following SR Ca 2+ release using the RyR agonist 4-chloro- 
m-cresol (4-CMC) (Figure 3D). Consistent with our previous 
findings, overexpression of MLN or SLN significantly decreased 
peak Ca 2+ release from the SR (Figure 3D). Together, these 



findings demonstrate that MLN inhibits 
SERCA pump kinetics similar to PLN 
and SLN. 

MLN, PLN, and SLN Are Expressed 
in Different Striated Muscle Types 

The structural and functional similarities 
between MLN, PLN, and SLN suggest 
they comprise a family of micropeptides 
that regulate Ca 2+ handling through mod- 
ulation of SERCA activity. To determine if 
PLN or SLN are functionally redundant 
with MLN in vivo, we examined their 
expression during developmental and 
adult stages. In the heart, PLN expression 
was detectable in both the atria and 
ventricles, whereas SLN expression was 
specific to the atria (Figures 4A and 
S3 A). SLN overlapped with the expres- 
sion of MLN in fetal skeletal muscles, 
which display a slow phenotype during 
embryonic development (Figure S3A) (Lu et al., 1 999). Consistent 
with previous reports, quantitative real-time PCR and RNA 
sequencing (RNA-seq) expression analyses revealed that SLN 
was markedly downregulated in most adult skeletal muscles, 
which convert to fast type in the mouse (Figures 4A and S3B) (Tu- 
pling et al., 201 1). In contrast, MLN was robustly expressed in all 
adult skeletal muscles, similar to the expression pattern of 
SERCA1 (Figures 4A and 4B). The expression patterns of PLN 
and SLN were more similar to the expression of SERCA2, which 
is highly expressed in cardiac and slow skeletal muscles (Fig- 
ure 4B). In addition, comparison of RNA-seq expression data re- 
vealed that PLN and SLN are not expressed in differentiated 
C2C12 myotubes, whereas MLN transcripts are robustly ex- 
pressed (Figure S3C). Thus, MLN, PLN, and SLN are differentially 
expressed across vertebrate muscle types, and MLN is the most 
abundant of the three micropeptides expressed in adult skeletal 
muscle of the mouse. 

Transcriptional Control of MLN Expression by MyoD 
and MEF2 

MLN and the skeletal muscle-specific isoforms of SERCA and 
RyR are coregulated by MyoD, suggesting they comprise a 
core genetic module important for Ca 2+ handling in skeletal mus- 
cle (Fong et al., 2012). Analysis of the 5' flanking region of the 
MLN gene revealed highly conserved binding sites for the 
myogenic transcription factors MyoD (E-box) and MEF2 (Fig- 
ure S4A), which bound specifically to these sequences in gel 
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Figure 5. Regulation of MLN Transcription by MyoD and MEF2 

(A) A fragment of the MLN promoter (-506 to +86, relative to the transcriptional 
start site) containing a highly conserved MyoD E-box (CACCTG) and MEF2 site 
(CTAATAACAG) was cloned in front of the luciferase reporter gene (MLN-Luc). 
The MLN-Luc reporter was robustly transactivated by the skeletal muscle 
transcription factors MyoD and MEF2 in COS7 cells (gray and red bars, 
respectively). Mutation of the E-box (acCCgt) or Mef2 site (CTgggAACAG) 



mobility shift assays (Figure S4B). MyoD and MEF2 indepen- 
dently activated a luciferase reporter linked to the proximal 
MLN promoter (MLN-Luc), and transactivation was lost upon 
mutagenesis of their respective binding sites (Figure 5A). Chro- 
matin immunoprecipitation sequencing (ChIP-seq) data at this 
locus revealed that MyoD bound specifically to this region in 
both C2C12 myoblasts and myotubes (Figure S4C) (Bernstein 
et al., 2012). The activity of the MLN-Luc promoter in C2C12 
myoblasts and myotubes was similarly dependent upon both 
the MEF2 and E-box sequences, which, when mutated, resulted 
in the loss and abrogation of MLN promoter activation, res- 
pectively (Figure 5B). Consistent with this, a LacZ transgene 
controlled by the MLN promoter (MLN-lacZ) displayed skeletal 
muscle-specific expression in vivo and was dependent upon 
the MEF2 and E-box sequences for full promoter activation (Fig- 
ure 5C). Thus, the MLN gene is a direct target of the core tran- 
scription factors that activate skeletal myogenesis. 

Generation of MLN Knockout Mice Using TALENs 

To investigate the function of MLN in vivo, we generated MLN 
knockout (KO) mice using TAL effector nuclease (TALEN)-medi- 
ated homologous recombination. A unique TALEN pair specific 
for exon 1 of the MLN locus was designed and constructed using 
the REAL assembly method (Reyon et al., 2012). A loss-of-func- 
tion allele was created using a donor plasmid to insert a red fluo- 
rescent reporter (tdTomato), followed by a triple polyadenylation 
cassette into exon 1 in the MLN locus (Figure 6A). This strategy 
was designed to prematurely terminate transcription upstream 
of the MLN coding sequence but allow for the expression of 
a red fluorescent reporter by the endogenous MLN promoter. 
Correct targeting was verified by Southern blot and PCR-based 
genotyping (Figures 6B and S5A). Detection of tdTomato fluores- 
cence in MLN KO mice was specific to skeletal muscle and was 
not detected in other tissues (Figure 6C). qPCR using primer 
pairs specific to the downstream exons 2 and 3 demonstrated 
that the MLN transcript was absent in skeletal muscle from 
MLN KO mice (Figure 6D). 

MLN KO mice were born at expected Mendelian ratios from 
heterozygous intercrosses and showed no obvious morpholog- 
ical abnormalities or differences in body or muscle weights (Fig- 
ure S5B). Mice lacking PLN and SLN also have nonpathological 
phenotypes but show enhanced Ca 2+ handling and contractility 
in cardiac and slow skeletal muscle (Luo et al., 1 994; MacLennan 
and Kranias, 2003; Tupling et al., 2011). 



(indicated by an X) abrogated transactivation by MyoD: El 2 heterodimer or 
Mef2c, respectively. All luciferase values were normalized to the trans- 
activation of a basal luciferase reporter (TATA-Luc) with MyoD or Mef2, 
respectively. Data are presented as mean ± SEM. 

(B) Luciferase reporter assays showing that MEF2 and E-box enhancer binding 
sites are essential for transactivation of the MLN promoter in C2C1 2 myoblasts 
and myotubes. Data are presented as mean ± SEM. 

(C) X-gal and H&E staining of El 0.5 mouse embryos harboring either the MLN 
promoter-lacZ transgene (WT) or mutations in the MLN promoter (AMef2 or 
AE-box). The MLN promoter showed expression in the myotomal compart- 
ment of the somites (orange arrow) and premyogenic cells in the mandibular 
arch (red arrow). Mutation of the MEF2 or E-box sequences in the MLN pro- 
moter-lacZ transgene abrogated or abolished muscle-specific expression. 
See also Figure S4. 
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Enhanced Ca 2+ Handling and Skeletal Muscle 
Performance in MLN KO Mice 

To examine the potential role of MLN in regulating skeletal mus- 
cle performance, we subjected 8-week-old wild-type (WT) and 
MLN KO mice to a regimen of forced treadmill running to exhaus- 
tion. Remarkably, MLN KO mice ran an average time of ~31 % 
longer than their WT littermates, representing a 55% increase 
in running distance (Figures 6E and 6F). Mice with a fast-to- 
slow fiber type switch also show increased running performance 
(van Rooij et al., 2009). However, histological analyses of hind- 
limb muscles, including quadriceps (fast-type) and soleus 
(slow-type) muscles revealed no obvious differences in fiber- 
type identity or myofiber size between WT and MLN KO mice 
(Figures S5C-S5E), indicating that MLN functions through a 
different mechanism to regulate muscle performance. 




Figure 6. Generation and Characterization 
of MLN Knockout Mice 

(A) TALEN-mediated homologous recombination 
was used to insert a tdTomato fluorescent reporter 
and triple polyadenylation cassette into exon 1 of 
the MLN locus to generate a null allele. A sche- 
matic of the donor vector and targeting strategy is 
shown. P, Pvull. 

(B) Southern blot analysis confirming correct tar- 
geting of the tdTO-triple polyadenylation cassette 
into exon 1 of the MLN locus using probes 5' and 3' 
to the TALEN cut site. 

(C) tdTomato fluorescence was specific to skeletal 
muscle (white arrowhead) of MLN KO mice and not 
detected in other tissues, such as tendon (blue 
arrowhead). 

(D) Real-time PCR using primers specific to exons 
2 and 3 demonstrating absence of MLN tran- 
scripts in MLN KO muscle, as well as in liver. 

(E) Muscle performance was measured using 
forced treadmill running to exhaustion. MLN KO 
(n = 15) mice ran ~31 % longer than WT littermates 
(n = 1 4). 

(F) Comparison of distance run by MLN KO and 
WT mice in Figure 6E. 

(G) Myoblasts isolated and cultured from MLN KO 
hindlimb muscles were imaged using Fura-2-AM 
and treated with the RyR agonist 4-CMC in the 
absence of extracellular Ca 2+ to indirectly mea- 
sure SR Ca 2+ levels. MLN KO myoblasts showed 
significantly increased SR Ca 2+ levels, measured 
as peak Ca 2+ release from the SR. Data are pre- 
sented as mean ± SEM (*p < 0.05 compared to WT 
as in E and F and pBx-empty in G). 

See also Figure S5. 
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Next, we investigated whether SR Ca 2+ 
levels were altered in primary myoblasts 
isolated from hindlimb muscles of MLN 
KO versus WT littermates. SR Ca 2+ levels 
were measured using the calcium indica- 
tor fura-2-AM and SR Ca 2+ release by 
the addition of the RyR agonist 4-CMC. 
Strikingly, MLN KO myoblasts showed 
significantly increased SR Ca 2+ levels 
compared to WT myoblasts, measured 
as peak SR Ca 2+ release (Figure 6G). This increase occurred 
without changes in the expression of RyRI and SERCA1 expres- 
sion between WT and MLN KO muscles (Figure S5F), consistent 
with the function of MLN as an inhibitor of SERCA activity. 
Together, these findings support the function of MLN as the 
dominant regulator of SERCA activity in adult skeletal muscle. 

DISCUSSION 



SERCA is a key regulator of striated muscle performance by 
serving as the major Ca 2+ ATPase responsible for the reuptake 
of cytosolic Ca 2+ into the SR (Figure 7A). Direct modulation of 
SERCA pump activity by the micropeptides PLN and SLN regu- 
lates muscle contractility by diminishing the rate of Ca 2+ reup- 
take into the SR. However, because PLN and SLN are not 
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Figure 7. A Family of SERCA-Inhibitory Micropeptides 

(A) RyR and SERCA play a critical role in muscle contractility by controlling 
Ca 2+ cycling between the cytosol and SR. MLN, PLN, and SLN inhibit SERCA 
pump activity in different striated muscle types of vertebrates. 

(B) Illustration of the family of SERCA-inhibitory micropeptides. The discovery 
of MLN reveals that vertebrates encode three SERCA inhibitory peptides 
that share conserved residues within their transmembrane alpha helices. 
Green shading denotes identical residues, and blue shading denotes similar 
residues. MLN, myoregulin; PLN, phospholamban; SLN, sarcolipin; SCL, 
sarcolamban. 



expressed in most adult skeletal muscles of mice, the possible 
existence of other SERCA regulatory factors has remained an 
open question. 

Our biochemical and in vivo results show that MLN forms 
a stable complex with SERCA in the membrane of the SR 
and that MLN directly influences SR Ca 2+ levels and maximal 



exercise performance. The robust skeletal muscle-specific ex- 
pression of MLN, resulting from direct transactivation by the 
myogenic transcription factors MyoD and MEF2, further highlight 
MLN as the predominant SERCA regulating micropeptide in 
adult skeletal muscle. The discovery of MLN reveals a universal 
mechanism for the control of SERCA activity by a family of 
related micropeptides expressed in different striated muscle 
types in vertebrates (Figure 7A). 

Micropeptides Can Be Concealed within RNAs 
Misannotated as Noncoding 

Micropeptides remain highly underrepresented in genome anno- 
tations due in large part to the difficulty in identifying functional 
short ORFs in RNA transcripts. Recent advances in bioinformatic 
and biochemical methodologies have revealed that IncRNAs 
may harbor concealed micropeptides; however, only a few 
have been functionally verified and characterized in vivo. Here, 
we have identified and characterized the function of a conserved 
micropeptide in vertebrates that functions as an important regu- 
lator of skeletal muscle performance through modulation of Ca 2+ 
handling by SERCA. Given the attention that has been focused 
on the control of SERCA activity and muscle function over the 
years, it is remarkable that such a key regulator of these pro- 
cesses has gone undetected. Undoubtedly, this is because the 
MLN ORF is concealed in an RNA annotated as noncoding. 
Our discovery of MLN was possible due to its high sequence 
conservation in vertebrates and the presence of an identifiable 
functional domain, a type II transmembrane alpha helix. We 
found that the MLN alpha helix shares residues in common 
with PLN and SLN; however, common bioinformatic search tools 
alone are not sufficient to identify the relatedness of MLN, PLN, 
and SLN. Interestingly, both PLN and SLN were first discovered 
as micropeptides and subsequently mapped to RNA transcripts 
(Kirchberber et al., 1975; Wawrzynow et al., 1992). The PLN and 
SLN genes share a genomic architecture similar to that of MLN, 
in which the small ORFs are encoded in the 5' region of the ter- 
minal exon. Indeed, the transcripts encoding PLN and SLN may 
have also been annotated as noncoding, if not for their prior dis- 
covery as micropeptides. 

A Family of SERCA-inhibitory Micropeptides 

In Drosophila, the invertebrate ortholog of SERCA is encoded by 
a single gene (Ca-P60A) and is modulated by the recently iden- 
tified micropeptide sarcolamban (SCL) (Magny et al., 2013). In 
vertebrates, the SERCA family has expanded to encode three 
genes (SERCA1-3) that give rise to multiple alternate splice var- 
iants with differing kinetic properties and expression patterns 
(Anger et al., 1994; Periasamy and Kalyanasundaram, 2007). 
The expansion of vertebrate gene families occurred through 
whole-genome and gene-duplication events, resulting in the for- 
mation of paralogous gene families. Combined with the tissue- 
specific expression patterns of the individual SERCA isoforms, 
the differential expression of MLN, PLN, and SLN likely contrib- 
utes to the unique Ca 2+ handling and contractile properties of 
different striated muscle types in vertebrates. In addition, the 
extent to which MLN, PLN, and SLN partially overlap in their 
expression in different muscle types may influence the calcium 
kinetics and performance of these tissues. Coexpression of 
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PLN and SLN in atrial cardiomyocytes and slow skeletal muscle 
has been shown to result in the superinhibition of SERCA pump 
activity (MacLennan et al., 2003). MLN expression overlaps with 
that of PLN and SLN in adult slow skeletal muscle and with SLN 
in developing skeletal muscles. Future biochemical and animal 
studies are required to determine the extent to which MLN syn- 
ergizes with PLN and SLN in coregulating SERCA activity in 
slow-type and developing skeletal muscle. SLN expression 
in larger animals is more widespread than in rodents, occurring 
in both fast- and slow-type muscles (Fajardo et al., 2013; Oder- 
matt et al., 1997). A synergistic interaction between MLN and 
SLN may be more biologically relevant in the adult tissues of 
these species; however, the relative expression patterns of these 
two genes in these species remain to be determined. 

Apart from spatiotemporal differences in expression, the 
SERCA-regulatory peptides differ in size and presence of addi- 
tional secondary structures. A schematic illustration of the family 
of SERCA-inhibitory micropeptides is shown in Figure 7B. Both 
SLN and SCL lack extended N-terminal regions and additional 
secondary structures other than their transmembrane alpha he- 
lices. MLN and PLN have expanded N-terminal cytoplasmic re- 
gions that encode a beta sheet and alpha helix, respectively. The 
N-terminal sequence of PLN has been shown to be critical for its 
function, and phosphorylation of serine-16 by protein kinase A 
(PKA) or threonine-1 7 by Ca 2+ /calmodulin-dependent protein ki- 
nase II (CaMKII) diminishes the ability of PLN to inhibit SERCA 
activity (Mattiazzi et al., 2006; Wegener et al., 1989). The N-ter- 
minal sequence of MLN could serve as a similar regulatory 
domain through phosphorylation, as this sequence contains 
multiple serine and threonine residues. Deciphering the physio- 
logical signaling pathways that regulate MLN expression and 
function will be important to fully understand its role in skeletal 
muscle development and disease. 

Future Questions 

Defects in Ca 2+ signaling underlie the pathogenesis of many 
muscle diseases that arise from mutations in components of 
Ca 2+ signaling pathways, as well as diseases that arise from a 
loss of myofiber structural integrity (Berchtold et al., 2000; Millay 
et al., 2008). Given the importance of SERCA pump activity in 
regulating Ca 2+ handling and the pathogenesis of skeletal mus- 
cle diseases, such as Brody myopathy and muscular dystro- 
phies (Allen et al., 2010; Goonasekera et al., 2011; Odermatt 
et al., 1 996), the discovery of MLN opens interesting possibilities 
for the modulation of these pathways. Considering the enhanced 
exercise capacity of MLN KO mice, it is interesting to speculate 
that pharmacologic approaches to disrupt the association of 
MLN with SERCA might have similar salutary effects. Modulation 
of SERCA activity in skeletal muscle has also been implicated in 
the control of systemic energy homeostasis (Bal et al., 2012), 
raising the interesting possibility that MLN may exert additional 
metabolic functions. Considering that SERCA is also important 
in Ca 2+ regulation in nonmuscle cell types in which MLN, PLN, 
and SLN are not expressed, it is interesting to speculate that 
additional SERCA-modulating micropeptides may be concealed 
within putative IncRNAs expressed in other tissues. Finally, the 
discovery of MLN as a previously unrecognized regulator of 
muscle function suggests that the microproteome, which is 



largely unexplored, represents a reservoir for future biological 
insights. 

EXPERIMENTAL PROCEDURES 

TALEN-Mediated Homologous Recombination in Mice 

A unique TALEN pair specific for the MLN locus was designed using the ZiFiT 
Targeter Program (http://zifit.partners.org/ZiFiT/lntroduction.aspx) and con- 
structed using the REAL Assembly Kit (Addgene) (Reyon et al., 2012). A donor 
vector containing the tdTomato reporter and triple polyadenylation sequences 
was constructed by incorporating short 5' and 3' homology arms specific to 
the MLN locus. TALEN mRNAs were in vitro transcribed using the mMessage 
mMachine T7 Ultra Kit (Life Technologies), diluted to 25 ng [of -1 , and coinjected 
with 3 ng pil 1 of the circular DNA donor plasmid into the nucleus and cyto- 
plasm of one-cell-stage embryos (B6C3F1) and transferred into pseudopreg- 
nant female ICR mice. 

Study Approval 

All experimental procedures involving animals in this study were reviewed and 
approved by the University of Texas Southwestern Medical Center’s Institu- 
tional Animal Care and Use Committee. 

Radioisotopic In Situ Hybridization 

In situ hybridizations were performed as previously described (Shelton et al., 
2000). See the Extended Experimental Procedures for a more detailed proto- 
col. The primer sequences used to clone MLN, PLN, and SLN cDNA templates 
are listed in Table S2. 

CRISPR/Cas9-Mediated Homologous Recombination in C2C12 
Myoblasts 

A single-guide RNA (sgRNA) specific to the C-terminal coding sequence of the 
mouse MLN locus was cloned into the sgRNA/Cas9 expression vector px330 
(MLN-FLAG-px330). The donor vector was constructed with a single FLAG 
epitope tag in-frame with the MLN coding sequence flanked by —500 base 
pair homology arms specific to the MLN locus. MLN-FLAG knockin clones 
were generated by transient cotransfection and expanded from single-cell 
clones. Detection of endogenous MLN-FLAG peptide was performed by 
immunoblotting with a rabbit anti-FLAG antibody (Sigma) on protein lysates 
immunoprecipitated with mouse anti-FLAG agarose beads (Sigma). 

Treadmill Running 

In blinded studies, male MLN KO and WT littermate mice were subjected to 
forced exercise on a treadmill (Exer-6M, Columbus Instruments, 10% incline) 
at 8 weeks of age using a regimen previously described (van Rooij et al., 2009). 
See the Extended Experimental Procedures for a more detailed protocol. 

Intracellular Ca 2+ Imaging 

Cytosolic Ca 2+ levels were measured as described previously (Liou et al., 2005), 
with the exception that retroviral-transduced C2C12 cells or primary myoblasts 
were plated on Ibidi |x-35 mm tissue-culture dishes and cultured for 24 hr in low 
Ca 2+ (0. 1 m M Ca 2+ ) prior to imaging to increase sensitivity to changes in SR Ca 2+ 
levels (Brandman et al., 2007). SR Ca 2+ levels were directly measured using 
TIER as previously described (Abell et al., 2011; Tsai et al., 2014). See the 
Extended Experimental Procedures for a more detailed protocol. 

Oxalate-Supported Ca 2+ Uptake Measurements in HEK293 Lysates 

Oxalate-supported Ca 2+ -dependent Ca 2+ -ATPase activity in homogenates 
was measured by a modification of the Millipore filtration technique as 
described previously (Holemans et al., 2014; Luo et al., 1994). HEK293 cells 
were cotransfected with equal amounts of an expression plasmid encoding 
mouse SERCA1 and an expression plasmid encoding MLN, PLN, or SLN. 
See the Extended Experimental Procedures for a more detailed protocol. 

Coimmunoprecipitations and Western Blot Analysis 

Coimmunoprecipitations (colPs) were performed as previously described 
(Anderson et al., 2009). Tris-tricine-SDS-PAGE was carried out using 16.5% 
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Tris-tricine gels (BioRad) and Tris-tricine-SDS running buffer (BioRad). See the 
Extended Experimental Procedures for a more detailed protocol. 

Real-Time PCR 

Total RNA was prepared from whole muscles using Trizol (Invitrogen) and 
treated with DNase prior to reverse transcription by Superscript III (Invitrogen). 
Real-time PCR was performed using TaqMan probes (ABI) or SYBR green using 
primers in Table S2. TaqMan probes include Mck (Mm00432556_m1), Mef2c 
(MmOl 340842_m1 ), SLN (Mm00481536_m1), PLN (Mm00452263_m1), and 
SERCA2 (Mm01201434_m1). Primers for SYBR green reactions are listed in 
Table S2. 

Electrophoretic Mobility Shift Assays 

Electrophoretic mobility shift assays (EMSAs) were performed as previously 
described (Anderson et al., 2012) using recombinant myc-tagged proteins ex- 
pressed in COS7 cells and double-stranded EMSA probes created by annealing 
complementary oligonucleotides. Probe sequences are listed in Table S2. 
Supershifts were performed by adding 1 pig of mouse anti-Myc antibody (Invitro- 
gen). See the Extended Experimental Procedures for a more detailed protocol. 

Luciferase Assays 

Luciferase assays were performed as previously described (Anderson et al., 
2009). Luciferase activity was measured using a FluoStar OPTIMA microplate 
reader (BMG Labtech) and normalized to beta-galactosidase activity using the 
FluoReporter LacZ/Galactosidase Quantitation Kit (Invitrogen). See the 
Extended Experimental Procedures for a more detailed protocol. 

Subcellular Fractionation 

C2C12 myoblasts infected with a retrovirus encoding the FIA-MLN fusion pro- 
tein (pBabeX-HA-MLN) were fractionated as previously described (Millay et al., 
2013). See the Extended Experimental Procedures for a more detailed 
protocol. 

Northern and Southern Blot Analysis 

Northern blots were performed using a commercially prepared adult mouse 
multitissue RNA blot (MN-MT-1; Zyagen) hybridized with a radiolabeled DNA 
probe specific to the full-length MLN transcript. Radiolabeled DNA probes 
for northern and Southern blots were generated using a RadPrime Kit (Invitro- 
gen) (Table S2). 

Histology and Immunohistochemistry 

Skeletal muscle tissues were dissected and fixed overnight in 4% formalde- 
hyde in PBS prior to paraffin embedding and sectioning using routine proce- 
dures. Immunohistochemistry was performed on deparaffin ized sections 
using a FlistoMouse-Plus kit (Invitrogen) using primary antibodies specific to 
fast (MY32; Sigma) and slow (NOQ7.54; Sigma) myosins. Wheat germ agglu- 
tinin (WGA) staining was performed using Alexa Fluor 555-conjugated WGA 
(Invitrogen) as described previously (Liu et al., 2011). 

Analysis of RNA-Seq Expression Data 

Raw data for C2C12 cells and triceps brachii muscle were downloaded from 
the Short Read Archive (SRP002119 and SRP008123, respectively). Reads 
were mapped to the UCSC mm9 genome annotation using TopHat, and align- 
ments were processed to bigWig coverage maps and viewed using the UCSC 
genome browser. 

Circular Dichroism Spectroscopy 

Circular dichroism (CD) spectroscopy measurements were performed using a 
JASCO J-815 spectrometer on in vitro-synthesized full-length MLN (Peptide 
2.0). The secondary structure elements were calculated using Yang’s fit. The 
root-mean-square deviation for the observed and calculated CD spectra 
values was 3%. See the Extended Experimental Procedures for a more 
detailed protocol. 

Structural Modeling and Automated Protein Docking 

The helical domains of MLN, PLN, SLN, and SCL were created ab initio using 
l-TASSER (Zhang, 2008). Automated protein docking of the MLN model with 



the crystal structure of SERCA1 (4H1W) was performed using ClusPro 2.0 
(Comeau et al., 2004). 

Adult Muscle Electroporation and Imaging 

Flexor digitorum brevis (FDB) muscles of 1 2-week-old male mice were electro- 
porated as previously described (Nelson et al., 2013), with expression vectors 
encoding N-terminal GFP fusions to MLN, PLN, or SLN. Unfixed FDB muscles 
were examined directly using two-photon laser scanning microscopy (Zeiss; 
LSM 780), with reverse second harmonic generation to visualize the A bands 
as an internal reference. See the Extended Experimental Procedures for a 
more detailed protocol. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Extended Experimental Procedures, five 
figures, and two tables and can be found with this article online at http://dx. 
doi.org/1 0.1 01 6/j. cell. 201 5.01 .009. 
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SUMMARY 

EBER2 is an abundant nuclear noncoding RNA ex- 
pressed by the Epstein-Barr virus (EBV). Probing its 
possible chromatin localization by CHART revealed 
EBER2’s presence at the terminal repeats (TRs) of 
the latent EBV genome, overlapping previously iden- 
tified binding sites for the B cell transcription factor 
PAX5. EBER2 interacts with PAX5 and is required 
for the localization of PAX5 to the TRs. EBER2 knock- 
down phenocopies PAX5 depletion in upregulating 
the expression of LMP2A/B and LMP1, genes near- 
est the TRs. Knockdown of EBER2 also decreases 
EBV lytic replication, underscoring the essential 
role of the TRs in viral replication. Recruitment of 
the EBER2-PAX5 complex is mediated by base-pair- 
ing between EBER2 and nascent transcripts from the 
TR locus. The interaction is evolutionarily conserved 
in the related primate herpesvirus CeHV15 despite 
great sequence divergence. Using base-pairing 
with nascent RNA to guide an interacting transcrip- 
tion factor to its DNA target site is a previously unde- 
scribed function for a trans - acting noncoding RNA. 

INTRODUCTION 

Epstein-Barr virus (EBV) is a human lymphotropic gamma-1 
herpesvirus (or lymphocryptovirus, LCV) that expresses two 
noncoding RNAs called EBER1 (EBV-encoded RNA 1) and 
EBER2, which are 167 and 173 nucleotides (nts) long, respec- 
tively (Lerner et al., 1981). They are expressed during all forms 
of EBV latency and also during lytic growth (Greifenegger 
et al., 1998; Rowe et al., 2009). EBER1 accumulates to ~10 6 
and EBER2 to ~2.5 x 10 5 copies per infected cell (Moss and 
Steitz, 2013). The high copy number and the evolutionary con- 
servation of EBERs in related primate LCVs point to a funda- 
mental role of EBERs in the EBV life cycle (Howe and Shu, 
1988; Rivailler et al., 2002b). To elucidate their function, recom- 
binant EBV strains lacking EBERs have been engineered and 
introduced into host B lymphocytes. However, conflicting obser- 
vations regarding possible effects on B cell growth and transfor- 
mation have been reported (Gregorovic et al., 2011; Swamina- 
than et al., 1991; Yajima et al., 2005). Thus, the physiological 
roles of EBERs have remained unclear. Likewise, only limited 
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mechanistic insights have been gained from investigating the 
interacting partners of these noncoding RNAs. Both EBER1 
and EBER2 bind the host RNA chaperone protein La (Lerner 
et al., 1981), whereas ribosomal protein L22 and AUF1 (AU- 
rich element binding factor 1)/hnRNP D (heterogeneous nuclear 
ribonucleoprotein D) additionally associate with EBER1 (Lee 
et al., 201 2; Toczyski et al., 1 994). A hallmark of EBV is its tumor- 
igenic potential, and several EBV latent proteins have been 
shown to contribute to oncogenicity (Young and Rickinson, 
2004). Intriguingly, EBERs by themselves, particularly EBER1, 
can cause tumors under certain conditions, but the exact molec- 
ular mechanism has not been elucidated even though EBERI’s 
interaction with L22 has been implicated in the process (Hou- 
mani et al., 2009; Komano et al., 1999; Repellin et al., 2010). 
The unidentified function(s) of EBERs must be confined to the 
nucleus, as they exhibit strictly nuclear localization and do not 
undergo nucleo-cytoplasmic shuttling (Fok et al., 2006). 

The EBV genome in virions is linear, flanked on both ends by 
tandem terminal repeats (TRs). These direct repeat units are 
538 and 544 base pairs (bps) long (in EBV type I and II strains, 
respectively) and contain high GC content (78%). Each viral 
genome contains a varying number of up to 20 TRs (Brown 
et al., 1 988). Upon infection of a host cell, the viral genome circu- 
larizes at the TRs, possibly through a recombination event, and 
amplifies as multicopy episomes during latency (Lindahl et al., 
1976; Sugden et al., 1979). Following genome circularization, 
the promoter region and exons of LMP2 (Latent Membrane Pro- 
tein 2), located at opposite ends of the linear genome, become 
juxtaposed to allow the expression of both LMP2A and LMP2B 
isoforms, which differ by alternative promoter usage (Raab- 
Traub and Flynn, 1986). Both isoforms modulate B cell receptor 
signal transduction to prevent premature lytic reactivation (Miller 
et al., 1994; Rovedo and Longnecker, 2007). During lytic replica- 
tion, the circular EBV genome is amplified, giving rise to long 
concatemers that are subsequently processed into unit length 
genomes. Again, processing occurs in the TR region by a 
proposed recombination event in conjunction with enzymatic 
cleavage by a so-called terminase complex (Chiu et al., 2014; 
Zimmermann and Hammerschmidt, 1995). The TRs further pro- 
vide an essential sorting signal for the linear genome to be pack- 
aged into capsids (Feederle et al., 2005). These observations un- 
derscore the important role(s) of the TR regions in EBV genome 
organization during both latency and the lytic cycle. The latency- 
lytic switch is subject to tight regulation, as an expanding body of 
evidence indicates that lytic replication contributes to oncogen- 
esis (Katsumura et al., 2012; Ma et al., 2011). 
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Figure 1. EBER2 Localizes to the TRs of the EBV Genome 

(A) Secondary structure model of EBER2. RNase H-sensitive regions (shown in B) are indicated in green. The region hybridizing to the ASO used in CHART is 
underlined. 

(B) Northern blot of EBER2 after RNase H digestion using DNA oligonucleotides complementary to EBER2. The numbers on top correspond to the nucleotides 
targeted in EBER2. An arrow indicates the mobility of full-length EBER2 RNA. U6 RNA serves as a loading control. 

(legend continued on next page) 
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B lymphocytes are the major cell type infected by EBV. The 
transcription factor PAX5 (Paired box protein 5) has been 
described as a master regulator of B lymphocyte development 
through promoting the expression of B cell specific genes and 
repressing B-lineage inappropriate genes (Medvedovic et al., 
2011). Specific DNA binding of PAX5 is achieved through the 
conserved paired box DNA-binding motif. Intriguingly, a recent 
study has shown that PAX5 binds to the TRs of EBV, perhaps 
to coordinate viral genome organization (Arvey et al., 2012). 
PAX5 has further been shown to regulate EBV latent gene 
expression, as depletion of PAX5 results, for example, in upregu- 
lation of the expression of LMP1 , the main transforming protein 
of EBV that acts as a classical oncogene, and of both LMP2 
isoforms (Arvey et al., 2012). 

Here, we performed capture hybridization analysis of RNA tar- 
gets (CHART) for EBER2, a method comparable to chromatin 
immunoprecipitation (ChIP), but in CHART the chromatin locali- 
zation of an RNA of interest is assayed (Simon et al., 201 1). In lieu 
of an antibody in ChIP experiments, CHART employs antisense 
oligonucleotides (ASOs) that hybridize to accessible regions of 
an RNA for selection (Figure SI ). We report that EBER2 localizes 
to the TRs of the latent EBV genome and provide in vivo evidence 
that its recruitment involves an RNA-RNA interaction with 
nascent RNA transcripts. This process in turn is required for effi- 
cient association of PAX5 with its target sites within the TRs. 
Perturbation of EBER2-PAX5 localization affects expression of 
genes nearest its binding site as well as lytic viral DNA replica- 
tion, with possible downstream effects on oncogenic processes. 

RESULTS 

EBER2 Co-Localizes with PAX5 to the TRs of the EBV 
Genome 

To identify an accessible region in EBER2 that could be targeted 
by an ASO for selection in CHART (Simon et al., 201 1), we added 
~30-nt-long DNA oligonucleotides complementary to EBER2 to 
lysate from EBV-positive BJAB-B1 cells (which contain type II 
EBV). Formation of DNA-RNA hybrids at accessible regions in 
EBER2 induces cleavage by endogenous RNase H. Two such re- 
gions in EBER2 (nucleotides 47-70 and 101-124) were detected 
by Northern blot analysis (Figures 1 A and 1 B). To select EBER2, 
we therefore coupled to agarose beads an RNA ASO targeting 
nucleotides 101-124, as the secondary structure of this region 
is predicted to form an extensive loop (Figure 1 A). 

We then used CHART to identify EBER2 binding sites on 
chromatin in the EBV-positive BJAB-B1 cell line; the isogenic 
EBV-deficient BJAB cell line served as a negative control (Fig- 
ure SI). Deep sequencing libraries from both cell lines were pre- 
pared after CHART and subjected to lllumina massive parallel 
sequencing. When the sequencing reads were mapped to the 
host cell genome, no obvious EBER2 peaks were present in in- 
fected BJAB-B1 cells compared to BJAB cells (data not shown). 
However, prominent EBER2 binding sites mapped to the 3' end 



of the annotated EBV genome (Figure 1C, bottom, bracketed re- 
gion). Since very few sequence reads from control BJAB cells 
map to the EBV genome (Figure 1C, top), these peaks are un- 
likely to represent host sequences that misalign with viral DNA. 
A zoomed-in view shows that EBER2 localizes to the TR regions 
of the EBV genome (Figure 1 D, top), its profile strikingly overlap- 
ping published ChIP data for the transcription factor PAX5 (Fig- 
ure ID). Because TRs represent tandem repeat sequences, as 
for PAX5 (Arvey et al., 2012), we cannot distinguish whether 
EBER2 binds to only one specific TR or whether it is equally 
distributed across all TRs, as depicted here. 

Given their co-localization on EBV chromatin, we asked 
whether EBER2 and PAX5 interact with each other. Co-immuno- 
precipitation after in vivo formaldehyde crosslinking using anti- 
PAX5 antibody showed that EBER2 interacts with PAX5, while 
EBER2 was not co-precipitated using an immunoglobulin G 
(IgG) control antibody (Figures 2A and S2A). A reciprocal exper- 
iment was performed using an EBER2 ASO (complementary 
to nucleotides 101-124) that should select EBER2-associated 
proteins. As shown by western blot analysis, PAX5 was enriched 
by the EBER2 ASO, while a control ASO against EBER1 failed to 
capture PAX5 (Figures 2B and S2B). We asked whether EBER2 
interacts directly with PAX5 by performing an RNA immuno- 
precipitation (IP) assay under denaturing conditions after UV 
crosslinking (Lee et al., 2012). EBER2 did not precipitate with 
anti-PAX5 antibody (Figure S2C), consistent with the fact that 
EBER2 does not exhibit a band-shift in the presence of recom- 
binant Pax5 in electrophoretic mobility shift assays (EMSAs) 
(Figures S2D-S2G). Together, these results suggest that 
EBER2 and PAX5 interact, but the association may be indirect. 

Based on its interaction with PAX5, we reasoned that EBER2 
might act in concert with PAX5 to regulate EBV-latent genes. 
Therefore, we knocked down EBER2 using chimeric ASOs that 
induce endogenous RNase H-mediated degradation (Table SI) 
(Ideue et al., 2009) and assessed the mRNA levels of several 
EBV-latent genes by qRT-PCR. Two knockdown ASOs (KD 
ASOs) that target the available regions in EBER2 (nucleotides 
101-124 and 39-62; Figures 1A and IB) efficiently depleted 
EBER2 to less than 20% of its original level upon nucleofection 
(Figure 2D). As latent gene expression at 48 hr post-nucleofec- 
tion did not change (data not shown), we introduced a second 
KD ASO nucleofection step at 48 hr and harvested the cells after 
3 days of depletion (Figure 2C). This procedure was necessary to 
maintain EBER2 at less than 20% its original level because 
EBER2 levels increased from ~18% at 24 hr to ~46% at 48 hr 
after a single knockdown. 

Upon EBER2 depletion, we observed that expression of 
LMP2A, and to a lesser extent LMP1 and LMP2B, was upregu- 
lated (Figure 2E), phenocopying the results of PAX5 knockdown 
(Arvey et al., 2012). We observed no significant change for other 
EBV genes (e.g., EBNA1 and BZLF1), as reported for PAX5 
depletion (Arvey et al., 2012), possibly because we examined 
the RNA levels at an earlier time point (after 3 days of EBER2 



(C) EBER2-CHART results from BJAB and BJAB-B1 cells. Deep sequencing reads were mapped to the entire EBV genome (x axis); the number of sequence reads 
is plotted on the y axis. Several EBV genes and the C promoter region are indicated. White boxes for BZLF1 and LMP1 indicate reverse gene orientation. 

(D) EBER2-CHART peaks in the TR region (bracket in C) and the PAX5 ChIP profile (from Arvey et al. 2012) are shown. 

See also Figure SI and Table SI . 
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Figure 2. EBER2 Interacts with PAX5 and Is Required for Efficient PAX5 Binding to TRs 

(A) Northern blot of EBER2 after IP with IgG (control) or anti-PAX5 antibody after formaldehyde crosslinking (+FA). In, 5% input; S, 5% supernatant; IP, 100%. 

(B) EBER1 and EBER2 ASOs were used to pull down associated proteins, followed by western blot using anti-PAX5 antibody (top). An arrow indicates PAX5; an 
asterisk indicates a non-specific band. In, 1 0% input; S, 1 0% supernatant; B, 1 00% beads. The same samples were subjected to northern blot analysis to detect 
EBER2 (bottom). Quantification of (A) and (B) are shown in Figures S2A and S2B. 

(C) Experimental outline for EBER2 knockdown. 

(D) Northern blot for EBER2 was carried out after EBER2 knockdown with two different KD ASOs (complementary to nucleotides 1 01-124 and 39-62) that target 
the nucleotides indicated in green in Figure 1 A. The same blot was probed for U6 as a loading control. 

(E) RNA levels of several EBV genes were assessed by qRT-PCR after EBER2 knockdown. 

(F) The LMP locus of the episomal EBV genome. LMP1 is transcribed in the opposite direction to LMP2. The variable number of TRs is indicated by (n). 

(G) PAX5 localization at the TRs after EBER2 knockdown was measured by ChIP-qPCR. The cellular CD79a promoter region, a known PAX5 target site, served as 
a positive Chi P control. The C promoter region of the EBV genome (Cp), an active promoter region not bound by PAX5, was the negative control. All data represent 
the mean of three independent experiments ± SD; **p = 0.008 (Student’s t test; n = 3). 

See also Figure S2 and Table SI . 



depletion as compared to after 5 days of PAX5 depletion) in an 
effort to reduce potential secondary effects of knockdown. Since 
the TRs are located in the first intron of LMP2A/B, positioned 
close to the transcription start site of LMP1 (Figure 2F), it is not 



unexpected for these genes to be most affected by lack of 
EBER2 localization to the TRs. 

Finally, we asked whether EBER2 knockdown would affect 
PAX5 recruitment to the TRs. As shown by PAX5 ChIP, EBER2 
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depletion specifically reduced PAX5 localization at the TRs, 
whereas its binding was unaltered at a cellular PAX5 target, the 
CD79a promoter region (Figure 2G) (Revilla-I-Domingo et al., 

2012). In summary, our results show that EBER2 is required for 
PAX5 recruitment to the TRs and synergizes with PAX5 to regu- 
late certain EBV latent genes. 

Base Pairing of EBER2 to Nascent LMP2 Transcripts 
Recruits PAX5 to the TRs 

The TR sequence in EBV DNA contains two strong PAX5 
consensus sequences (Arvey et al., 2012), suggesting that 
recruitment of the EBER2-PAX5 complex could be achieved 
through the DNA binding domain of PAX5. To test whether 
EBER2 is localized to the TRs by virtue of its interaction with 
PAX5, we knocked down PAX5 using lentivirally expressed short 
hairpin RNAs (shRNAs) as described previously (Arvey et al., 
2012). Efficient depletion of PAX5 in BJAB-B1 cells was con- 
firmed by western blot (Figure 3A); the knockdown efficiency 
(to ~40% of the original level) was sufficient to elicit upregulation 
of LMP2A as measured by qRT-PCR (Figure 3B), arguing that the 
transcriptional control imposed by EBER2-PAX5 was compro- 
mised under these conditions. As expected upon depletion, 
PAX5 localization assessed by ChIP was significantly reduced 
at both the TRs and the CD79a promoter region (Figure 3C). 
However, surprisingly, EBER2 localization at the TRs remained 
unaffected as measured by CHART coupled to qPCR analysis 
(Figure 3D). We conclude that EBER2 is required for PAX5 local- 
ization (Figure 2G), but PAX5 is dispensable for EBER2 recruit- 
ment to the TRs. 

Since EBER2 appears to be the key recruiting entity of the 
EBER2-PAX5 RNP, we considered the possibility of EBER2 
recruitment via an RNA-RNA interaction. The fact that EBER2 
binding to the TRs as assessed by CHART was not affected 
when RNase H digestion preceded ASO selection argues 
against an RNA-DNA interaction (Figures S3A and S3B). On 



Figure 3. PAX5 Is Dispensable for EBER2 
Recruitment to the TRs 

(A) Knockdown efficiency of PAX5 by lentivirally 
expressed shRNA was determined by western blot 
using anti-PAX5 antibody. Anti-Nucleolin antibody 
provided a loading control. 

(B) qRT-PCR analysis after PAX5 knockdown. 

(C) PAX5 ChIP-qPCR analysis after PAX5 knock- 
down. *p = 0.03, **p = 0.002 (Student’s t test; n = 3). 

(D) EBER2-CHART followed by qPCR analysis af- 
ter PAX5 knockdown. All data represent the mean 
of three independent experiments ± SD. 

See also Table SI. 



J □ KDPAX5 

the other hand, a region within EBER2 
that base pairs with an RNA transcribed 
from the TR region, such as the nascent 
transcripts of the two LMP2 isoforms, 
I ph which contain the TRs in their first intron, 
might exist (Figure S4B). If this RNA- 
RNA-mediated recruitment model were 
correct, we should be able to (1) identify 
the complementarity, (2) show that tran- 
scription through the TRs is required for EBER2 recruitment, 
and (3) obtain evidence for a physical interaction between 
EBER2 and the nascent TR sequence-containing transcript 
in vivo. 

Potential base pairing between EBER2 and the EBV type II TR 
RNA sequence was examined using the RNAup program to 
search for short, stable RNA-RNA interactions (Muckstein 
et al., 2006). A putative 18-bp hybrid was identified with a pre- 
dicted free energy of AG = -28.10 kcal/mol. This hybrid com- 
prises EBV TR nucleotides 177-200 and EBER2 nucleotides 
41-64 (Figure 4A). Intriguingly, this sequence in EBER2 coin- 
cides with one of two oligonucleotide-accessible sites identified 
by RNase H digestion (Figure IB). Notably, the sequence within 
the TR transcript (nucleotides 177-200) predicted to base pair 
with EBER2 lies adjacent to one of the PAX5 consensus sites (nu- 
cleotides 200-214; Figure 4A). A second putative 17-bp RNA- 
RNA hybrid of comparable predicted stability was identified 
that also overlaps with the same oligonucleotide-accessible re- 
gion within EBER2 (site A versus site B of EBV type II; Figures 
S5B-S5D). 

To show that the accessible region in EBER2 (nucleotides 41- 
64) is indeed necessary for its recruitment to TRs, we used an 
antisense morpholino oligonucleotide (AMO) that anneals to 
this region (AMO EBER2-1 ; Table SI ; Figure 5A, left panel, nucle- 
otides in blue) to block the putative EBER2-TR base-pairing inter- 
action. We analyzed mRNA levels for several EBV-latent genes 
upon nucleofection of the 25-nt AMO EBER2-1 and observed 
that LMP2A, and to a lesser extent LMP1 and LMP2B, transcripts 
increased in level compared to a scrambled control AMO (AMO 
CTRL) and an AMO targeting EBER2 nucleotides 146-170 
(AMO EBER2-2; Table SI), which are not predicted to form 
RNA-RNA interactions (Figures 4B and S3C). Moreover, AMO 
EBER2-1 treatment reduced PAX5 localization specifically at 
the TRs as measured by ChIP (Figure 4C). We were unable to 
use an AMO against the TR nucleotides 177-200 to block base 
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Figure 4. EBER2 Is Recruited to the TRs through Base Pairing with Nascent RNA from the TR Locus 

(A) Predicted RNA-RNA interaction between EBER2 and a region within the TR (bottom). TR coordinates are also shown for the two PAX5 consensus sites (top). 

(B) RNA levels of several EBV genes were measured by qRT-PCR after 3 days of treatment with EBER2 AMO complementary to nucleotides 35-59 (AMO EBER2- 
1) or nucleotides 146-170 (AMO EBER2-2). 

(C) Quantification of the PAX5 ChIP at the TR, Cp, and CD79a loci after treatment with AMO EBER2-1 or EBER2-2. The control IgG ChIP data (data not shown) 
were comparable to those in Figure 2G. *p = 0.03 (Student’s t test, n = 3). 



(legend continued on next page) 
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Figure 5. EBER2-Guided Recruitment of PAX5 
to the TRs Appears to Be Evolutionarily 
Conserved in a Related Gamma-Herpesvirus 

(A) Secondary structure models of EBV EBER2 and 
the EBER2 homolog of rhesus LCV (CeHVI 5). Nu- 
cleotides predicted to base pair with TR RNA are 
indicated in blue. 

(B) Relative position and coordinates within the TR, 
as well as the PAX5 consensus site, are shown (top). 
The predicted RNA-RNA interaction between 
CeHVI 5 EBER2 and the TR RNA is shown at the 
bottom. 

(C) Model for complementary base-pairing-mediated 
recruitment of EBER2-PAX5 RNP to the TR region. 
See also Figure S3. 
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pairing of the nascent transcript with EBER2, as the nucleotide 
sequence of this AMO exhibits strong self-complementarity 
(data not shown). In summary, blocking the putative base-pairing 
region of EBER2 with an AMO results in the same phenotype 
as EBER2 depletion (Figures 2E and 2G). Importantly, CHART 
confirmed decreased EBER2 binding to the TRs in the presence 
of the AMO EBER2-1 that targets the predicted RNA-RNA inter- 
action site (Figure 4D). These results argue that base pairing of 
EBER2 to the nascent LMP2A/B transcript could be instrumental 
in recruiting EBER2 and PAX5 to the TRs. 

To demonstrate that transcription through the TRs is necessary 
for EBER2 recruitment, we interfered with LMP2 gene expres- 
sion, which generates nascent transcripts containing TR se- 
quences within the first intron (Figure 5C). We used CRISPR- 
mediated transcriptional interference by the catalytically inactive 
dCas9 protein to specifically silence both LMP2 isoforms (Gilbert 
et al., 2013). We generated stable dCas9-KRAB-expressing 
BJAB-B1 cells and identified potent single-guide RNAs (sgRNAs) 



against LMP2A and LMP2B (Figures 4E and 
S4A-S4C). LMP1 expression was collater- 
ally silenced by this approach, probably 
because its promoter is located close to 
the LMP2B transcription start site. Inhibiting 
the expression of nascent transcripts 
containing TR sequences by silencing both 
LMP2 isoforms resulted in decreased 
EBER2 binding to the TRs as determined 
by CHART (Figure 4F). Simultaneous usage 
of sgRNAs against both LMP2 isoforms 
was necessary to observe decreased 
EBER2 binding, as inhibiting one isoform alone did not affect 
EBER2 localization to the TRs (Figure S4D; data not shown). 

Finally, to confirm in vivo the predicted base-pairing interaction 
between EBER2 and TR sequence-containing RNA, we used the 
psoralen derivative aminomethyltrioxsalen (AMT). AMT preferen- 
tially crosslinks pyrimidine bases on opposite strands of double- 
stranded RNA after irradiation with 365 nm UV light; 254 nm UV 
light irradiation reverses the crosslinks (Cimino et al., 1 985). First, 
we ascertained that AMT is able to crosslink the predicted inter- 
action sites between EBER2 and nascent RNA containing the TR 
sequence. We in vitro transcribed a 42-nt RNA from the TR region 
(Table SI) that contains the sequence predicted to base pair with 
EBER2 and added it to an EBER2 RNP containing cell lysate. Only 
in the presence of AMT and the in-vitro-transcribed TR RNA did 
we observe a higher molecular weight band in an EBER2 North- 
ern blot (Figure 4G, lane 4, arrow). This band disappeared when 
crosslinks were reversed by irradiating with 254 nm UV light (Fig- 
ure 4G, lane 6), indicating that the predicted base pairs form 



(D) EBER2-CHART followed by qPCR analysis after AMO EBER2-1 or KD ASO treatment. 

(E) qRT-PCR analysis after expressing sgRNAs targeting LMP2A and LMP2B (Figure S4B) in dCas9-KRAB-expressing BJAB-B1 cells. 

(F) EBER2 CHART was conducted after CRISPR/dCas9-mediated transcriptional interference of LMP2 genes. **p = 0.01 (Student’s t test, n = 3). 

(G) EBER2-RNP-containing cell lysate was incubated with an in-vitro-transcribed 42-nt RNA fragment from the TR region (nucleotides 1 67-208) predicted to base 
pair with EBER2 as shown in (A). AMT was added where indicated, and the reaction was exposed to long-wave UV light (365 nm). Short-wave UV-light irradiation 
(254 nm) was included as indicated to reverse crosslinks. RNA was isolated, and northern blotting was carried out on a denaturing urea-polyacrylamide gel, 
probing for EBER2 and the in-vitro-transcribed TR RNA. An arrow indicates EBER2 crosslinked to the TR RNA fragment. 

(H) EBV-positive cells were treated with long-wave UV light in the presence or absence of AMT. EBER1 and EBER2 were selected using specific ASOs and, 
together with co-precipitated RNAs, were reverse transcribed for qRT-PCR analysis. The abundance of TR-sequence-containing RNA was measured in EBER1 - 
and EBER2-selected samples. Primers detecting 18S rRNAand GAPDH mRNA were used as negative controls. All data represent the mean of three independent 
experiments ± SD. 

See also Figures S3 and S4 and Table SI . 
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Figure 6. EBER2 Depletion Results in 
Decreased Viral Lytic Replication 

(A) Viral lytic replication after KD-ASO-mediated 
EBER2 depletion was measured by Southern blot 
analysis using the Xhol.9 probe. The same blot 
was probed for the cellular GAPDH locus as a 
loading control. 

(B) Experimental outline for viral lytic induction by 
sodium butyrate (NaB) in combination with EBER2 
knockdown by KD ASO. 

(C) Quantification of three independent experi- 
ments as shown in (A). 

(D) Viral lytic replication was measured by qPCR 
analysis, amplifying the EBV dyad symmetry DNA 
region normalized to the cellular actin gene. 

(E) Relative viral titer in supernatant was measured 
by qPCR normalized to spike-in control. All data 
represent the mean of three independent experi- 
ments ± SD. 

See also Figure S6 and Table SI . 



in vitro. In vitro crosslinking was also observed with the second 
EBER2-TR interaction site (Figure S5E), suggesting that both 
RNA-RNA interactions could contribute to association in vivo. 
To show that base-pairing interaction does occur in vivo, we en- 
riched EBER2, as well as EBER1 as a control, using ASO beads 
under denaturing conditions from a lysate of cells after in vivo 
psoralen crosslinking. EBERs, together with crosslinked RNAs, 
were eluted from the ASO beads with tetraethylammonium chlo- 
ride-containing buffer to minimize background (Figure S5A). TR 
RNA was enriched by EBER2 compared to EBER1 ASO beads 
only after psoralen crosslinking as measured by qRT-PCR (Fig- 
ure 4H). No enrichment was observed for 18S rRNA or GAPDFI 
mRNA. Together, these results argue that EBER2 base pairs 
with TR sequence-containing nascent transcripts of the LMP2A 
and 2B genes in vivo. 

EBER2’s Interaction with Nascent RNA Is Evolutionarily 
Conserved 

Of the many EBV-related primate LCVs (Lacoste et al., 2010), 
complete genome sequences are available only for EBV (type I 
and type II), for the rhesus (Cercopithecine herpesvirus 15, 
CeHV15), and for the marmoset LCV (Callitrichine herpesvirus 3) 
(Rivailler et al., 2002a, 2002b). The genome of the last has appar- 
ently lost its EBER2 gene, while CeHVI 5 retains an EBER2 homo- 
log. Even though only moderate sequence conservation (65%) is 
exhibited, CeHVI 5 EBER2 can be modeled to fold into a structure 
that is almost identical to that of EBV EBER2 (Figure 5A). The 
CeHVI 5 TR, on the other hand, has no obvious sequence similar- 
ity to the EBV TR sequence, except for high overall GC content 
and a tandem repeat organization (Rivailler et al., 2002b); the 
repeat unit is considerably longer (933 bp) and contains only a sin- 
gle PAX5 consensus sequence (Figure 5B, top). 

Using the RNAup program, we searched for an RNA-RNA 
interaction between CeHVI 5 EBER2 and its TR sequence. 
Only one stable hybrid was predicted with a free energy of 
AG = -25.60 kcal/mol (Figure 5B, bottom). Strikingly, the 
sequence within the CeHVI 5 EBER2 homolog predicted to 
base pair with the TR is in the same relative location as is that 
in EBV EBER2 (Figure 5A, nucleotides in blue). Furthermore, 



despite great sequence divergence, the region within CeHVI 5 
TR predicted to base pair with its EBER2 homolog overlaps 
the PAX5 consensus site, similar to its position in the EBV TR 
(compare Figure 5B and Figure 4A). Thus, the EBER2 guide func- 
tion of PAX5 to the TRs appears to be evolutionarily conserved in 
the rhesus virus and possibly other yet unsequenced LCVs. 

EBER2 Knockdown Affects Viral Lytic Replication 

Given the localization of EBER2 at the TRs, which during the lytic 
cycle are the sites of viral genome linearization upon packaging 
into capsids (Zimmermann and Hammerschmidt, 1995), we 

asked whether viral lytic replication might be affected by 
EBER2 depletion. We treated the replication-permissive EBV- 
positive cell line HH51 4-1 6 with sodium butyrate (NaB) to induce 
lytic replication in combination with EBER2 KD ASO nucleofec- 
tion (Figures 6B and S6A) (Ragoczy et al., 1998). Genomic 
DNA was isolated and subjected to Southern blot analysis using 
the Xhol .9 probe, which covers a unique sequence adjacent to 
the TR region, to quantify lytic replication (Figure S6B) (Raab- 
Traub and Flynn, 1986). After EBER2 depletion, lytic replication 
was decreased as analyzed by Southern blot and qPCR (Figures 
6A, 6C, and 6D). The viral titer in the culture medium was reduced 
to an even greater extent (Figure 6E). Depletion of PAX5 consis- 
tently resulted in comparable decreased lytic replication (Figures 
S6C-S6E), supporting the notion that EBER2 and PAX5 act in 
concert. NaB promotes the expression of Zebra, an EBV tran- 
scription activator encoded by the BZLF1 gene that acts to 
induce lytic replication (Miller et al., 2007). EBER2 knockdown 
did not affect Zebra expression (Figure S6F), ruling out the pos- 
sibility that the decreased lytic replication was due to a lower 
level of Zebra. Interestingly, the decrease in viral replication 
was restricted to lytic replication, since latent replication, 
measured in the absence of NaB induction (see Figure 2C), 
was unaffected upon EBER2 knockdown (Figure S6G). 

DISCUSSION 

Using CHART, we identified EBER2 as localized to the TRs of 
the latent EBV genome. We were unable to examine EBER1 
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localization by CHART because no region in EBER1 is accessible 
for hybridization with an ASO (Lee et al., 2012). A potential func- 
tion of EBER2 at the TRs was suggested by its chromatin co- 
localization and interaction with the B cell master regulator 
PAX5. Even though two strong PAX5 consensus sequences are 
present in each TR unit, recruitment of the EBER2-PAX5 RNP 
does not appear to be achieved by direct PAX5 binding to DNA 
but is greatly facilitated by nucleotide sequence information 
contributed by EBER2 (Figure 5C). Perhaps because of the 
degenerate nature of the PAX5 consensus sequence (Fig- 
ure S2D), the EBER2 RNP is required for efficient recruitment to 
the DNA target site (Figure 2). Alternatively, the EBER2 RNP 
may be required to clear adjacent chromatin of interfering fac- 
tors) or to stabilize PAX5-DNA binding. We identified a base- 
pairing interaction between EBER2 and nascent transcripts 
from the TR regions that could provide such enhanced targeting 
specificity (Figure 4A). Indeed, the process of transcription 
through the TR regions per se might be integral to the recruitment 
mechanism by opening the chromatin conformation and thus 
facilitating accessibility of the PAX5-EBER2 RNP to its target site. 

We identified PAX5 as a novel EBER2-interacting protein, 
prompted by the observation that this transcription factor and 
the viral noncoding RNA co-localize at the TRs (Figure 1). The 
interaction appears to be indirect, based on the negative results 
of EMSAs and UV crosslinking experiments (Figures S2C-S2G). 
We are currently attempting to further analyze the EBER2-PAX5 
RNP, particularly focusing on the factor that bridges EBER2 and 
PAX5. Depletion of either EBER2 or PAX5 exhibits overlapping 
phenotypes, such as the upregulation of LMP genes (Figure 2E), 
suggesting both afunctional and a physical interaction of the two 
factors. PAX5 depletion was reported to have a broader effect on 
EBV-latent gene expression (Arvey et al., 2012) than we observe 
here, which might be explained by the different time points and 
methods used. While Arvey et al. (2012) achieved PAX5 knock- 
down by lentiviral expression of shRNAs and gene expression 
changes were examined after 5 days, we chose an earlier time 
point (3 days) for analyzing gene expression upon EBER2 deple- 
tion by KD ASOs in an effort to exclude secondary and pleio- 
tropic effects of long-term PAX5 knockdown. 

Our attempts to recapitulate EBER2-PAX5-guided transcrip- 
tional silencing at EBV TRs in heterologous reporter systems 
were unsuccessful. When the entire LMP2A locus, including 
the TRs in its first intron, was cloned into a vector and expressed 
together with EBER2 and PAX5 in B lymphocytes, no effect on 
LMP2A expression was observed (Figure S3D). Similarly, incor- 
porating TRs into the 3' end of a luciferase reporter gene was 
also unresponsive to the presence of EBER2 and PAX5 (Fig- 
ure S3E). Apparently, a nascent transcript from the TRs alone 
is not sufficient for PAX5 recruitment through EBER2, and other 
factors, possibly DNA elements, are necessary to establish a 
silenced chromatin architecture. 

PAX5 at the TRs appears to exert a function other than acting 
as a classical transcription activator or a repressor, as is normally 
found at promoters. The repeat organization of the TRs might be 
a crucial aspect of the PAX5 localization mechanism and 
perhaps also of function. Indeed, the related transcription fac- 
tors, Pax3 and Pax9, have been reported to restrict RNA output 
from mouse satellite repeat sequences by binding and recruiting 



histone methyl transferases to silence repetitive DNA (Bulut-Kar- 
slioglu et al., 2012). In another uninfected cellular context, Pax5 
action has been reported to regulate the immunoglobulin heavy 
chain (Igh) locus during VDJ recombination in pro-B cells (Ebert 
et al., 2011). Parallels include (1) repeat regions bound by Pax5, 
the so-called Pax5-activated intergenic repeat elements (PAIRs), 
of which 14 are interspersed in the Igh locus, and (2) strikingly, 
the existence of a noncoding RNA expressed from the Igh locus. 
Expression of the RNA coincides with Pax5 binding to the PAI Rs, 
whereas at later times in B cell development when the RNA 
ceases to be made, Pax5 localization is no longer detected. It 
is tempting to speculate that the Igh noncoding transcript might 
contribute to Pax5 recruitment similarly to the nascent transcript 
emanating from the TR regions of EBV. Furthermore, Pax5 in- 
duces chromatin condensation of the Igh locus (Fuxa et al., 
2004). If an analogous chromatin contraction occurs at the TRs 
of the EBV genome, transcriptional upregulation of LMP genes 
following perturbation of the EBER2-PAX5-mediated control 
mechanism might be explained by a looser chromatin conforma- 
tion that facilitates transcription through the region. 

Consistent with the possibility that genome organization is 
regulated by the EBER2-PAX5 interaction, EBER2 depletion 
does not result in immediate transcriptional upregulation of 
EBV genes nearest to its binding site; changes become apparent 
only after 3 days of knockdown. This observation suggests that 
the genome organization at the TRs, once established, remains 
stable unless the correct organization cannot be resumed 
following genome replication and/or dilution of regulating factors 
by knockdown. Importantly, we demonstrate that EBER2 deple- 
tion affects viral lytic replication and propose that decreased lytic 
replication might be a consequence of improper genome organi- 
zation that hinders efficient replication. Perhaps, latent replica- 
tion is not affected because the TRs contribute differently to 
the replication of EBV episomes, compared to the production 
of linear packaged virion DNA. It is of course possible that there 
are additional consequences of EBER2 knockdown that we have 
not assessed in this study. One such possibility, given the fact 
that both EBER2 and lytic replication have been implicated in 
promoting oncogenicity (Katsumura et al., 2012; Ma et al., 
2011), is an interplay between the function of EBER2 at the 
TRs and the consequences of lytic replication on tumor forma- 
tion. An indication that EBER2 could have function(s) in addition 
to recruiting PAX5 to the TRs stems from the fact that more 
EBER2 molecules are present than complementary TR binding 
sites; in an infected cell, there are 2.5 x 10 5 EBER2 molecules 
and up to 50 EBV episomes, each containing up to 20 TRs, 
although each TR probably harbors multiple nascent transcripts. 
On the other hand, the overabundance of EBER2 molecules 
compared to the number of TR binding sites during latency could 
be necessary to accommodate the massive increase in EBV 
genome copy number, and hence TR binding sites, occurring 
during viral lytic replication. 

In recent years, more and more long noncoding RNAs 
(IncRNAs), arbitrarily defined as >200 nucleotides in size, have 
been shown to fulfill a diversity of cellular functions (Cech and 
Steitz, 2014). In addition to post-transcriptional regulation, a 
common theme is the interaction of IncRNAs with chromatin- 
modulating factors to control gene expression (Huarte et al., 
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2010; Nagano et al., 2008; Tsai et al., 2010; Zhao et al., 2008). 

Thus, IncRNAs have been proposed in theory to act as targeting 
guides for effector proteins by base pairing with specific chro- 
matin sites via RNA-RNA interactions, RNA-DNA interactions 
(Gilbert et al., 2013), or triplex formation (Schmitz et al., 2010). 
RNA-RNA interactions are the best-supported interaction 
mode experimentally for noncoding RNAs smaller than 200 nu- 
cleotides, as exemplified by small nucleolar RNAs acting in 
RNA modification, small nuclear RNAs in pre-mRNA splicing, 
or microRNAs in targeting mRNAs (Kim et al., 2009; Watkins 
and Bohnsack, 2012; Will and Luhrmann, 201 1). A guide function 
for targeting specific sites on chromatin thus far has been 
ascribed to tiny RNAs only, such as piwi-interacting RNA-medi- 
ated recruitment of PIWI in Drosophila, small-interfering-RNA 
(siRNA)-mediated centromeric silencing in yeast, and siRNA- 
directed DNA methylation in plants (Lejeune et al., 2010; Malone 
and Hannon, 2009; Matzke et al., 2007). All previously reported 
IncRNAs appear to fulfill an architectural scaffolding function, 
often with chromatin-regulating proteins. Here, for the first 
time, we provide evidence for a base-pairing interaction of a 
frans-acting moderately sized noncoding RNA, EBER2, which 
facilitates the recruitment of an associated transcription factor 
to chromatin target sites. 

The ability of EBER2 to help recruit PAX5 to the TRs appears 
to be evolutionarily conserved not only in both type I and type 
II EBV (Figures S5B-S5E) but also in a related rhesus LCV. 
Marmoset LCV, a virus that infects a new world primate, has 
lost an EBER2 homolog but nonetheless retains a strong PAX5 
consensus site within its TR (Rivailler et al., 2002a), suggesting 
that PAX5 binding occurs. This raises the question of whether 
PAX5 is recruited by an analogous mechanism involving a yet 
unidentified noncoding RNA, which has replaced EBER2, or 
whether a compensatory mechanism not involving an RNA- 
RNA interaction mediates recruitment of PAX5 in marmoset 
LCV. A better understanding of the precise EBER2-PAX5 RNP 
composition will be essential to distinguish between the two 
possibilities. Another open question is whether EBER2 is re- 
cruited to TRs in EBV-infected cells that do not express the B- 
cell-specific factor PAX5, such as nasopharyngeal carcinoma 
(NPC) cells, which are epithelial. As NPCs exhibit latency II 
(Rowe et al., 2009), characterized by robust expression of 
LMP1 and LMP2 genes, as well as EBER2, EBER2-PAX5-medi- 
ated transcription inhibition is unlikely to occur in these cells. 
Similarly, the questions of whether and how PAX5 is recruited 
to the TRs in EBV strains carrying a deletion of the EBER2 
gene also remain to be answered. 

EXPERIMENTAL PROCEDURES 
CHART Assay 

Endogenous RNase H cleavage assays were performed as described in Lee 
et al. (2012). CHART was carried out as described in Simon (2013), with minor 
modifications. CHART-seq data were deposited in the Sequence Read 
Archive under the accession number SRR1 640963. For detailed protocols, 

see the Extended Experimental Procedures. 

EBER2 and PAX5 Knockdown 

2.5 x 10 6 BJAB-B1 cells were nucleofected with 10 pil of 100 mM KD ASO/ 
AMO stock solution in SF solution with program EN-150 using the Lonza 



4D-Nucleofector System. HH514-16 cells were nucleofected using the Lonza 
2b Device with solution V and program A-023. On the day following nucleofec- 
tion, cells were separated from debris using Lymphocyte Separation Medium 
(Corning Cellgro) in accordance with the manufacturer’s instructions. For KD 
ASO and AMO sequence information, see Table SI . KD ASOs were designed 
(Ideue et al., 2009) with DNA nucleotides flanked by 2 / -0-methyl nucleotides 
and consisting of a phosphorothioate backbone (for increased stability) to 
induce cleavage by endogenous RNase H. 

RNAi against PAX5 was performed by lentiviral shRNA expression using 
MISSION shRNA clones TRC0000016061 and TRC000001 6062 (SIGMA) as 
described previously (Arvey et al., 2012; Cozma et al., 2007). Lentiviruses 
were produced as described previously (Lee et al., 2012). Cells were cultured 
under puromycin selection 1 day after infection and harvested 4 days post- 
infection, as a decrease in proliferation rate became apparent at this time 
point. 

Psoralen Crosslinking of RNAs 

A 42-nt or 36-nt RNA within the TR region (nucleotides 1 67-208 and 74-1 09 of 
EBV type II, respectively) predicted to base pair with EBER2 was in vitro tran- 
scribed with T7 polymerase from an oligonucleotide template (see Table SI for 
sequence). For psoralen crosslinking of EBER2 RNP to the in-vitro-transcribed 
TR fragment, nuclei were isolated by lysing 10 7 BJAB-B1 cells in 10 mM Tris 
(pH 8.0), 0.32 M sucrose, 3 mM CaCI, 0.1 mM EDTA, and 0.1% NP-40 and 
were then resuspended in 100 nl of 10 mM HEPES (pH 7.4), 150 mM KCI, 
5 mM MgCI 2 , 0.2 mM DTT, 1 0% glycerol, and 0.5% NP-40 to generate nuclear 
extract. The lysate was cleared by a 3-min centrifugation step at full speed in a 
table-top centrifuge. 200 ng of in-vitro-transcribed TR fragment was added to 
10 pil of nuclear extract in the presence of 40 jxg/ml aminomethyltrioxsalen 
(AMT), 0.2 pig tRNA, and 40 U RNase inhibitor and incubated 30 min at room 
temperature (RT) before the reaction was irradiated for 30 min on ice covered 
with a 2-mm-thick glass plate from a distance of 2.5 cm with a handheld 365- 
nm UV lamp. Extracts were treated with 254-nm UV irradiation for 10 min on 
ice, where applicable. After crosslinking, RNA was isolated with TRIZOL and 
subjected to Northern blot analysis. 

For in vivo crosslinking of intact cells, 2 x 1 0 7 cells were resuspended in 1 ml 
of growth medium containing 50 |ag/ml AMT, incubated for 5 min at 37°C 
before chilling the cells on ice, and irradiated with UV light as described above. 
Cells were washed with PBS, and RNA was isolated with TRIZOL and DNase 
treated. 20 [ig total RNA was heated in 1 00 |il TE buffer at 95°C for 3 min and 
chilled on ice before 50 [xl Denaturant buffer and 1 50 pil 2 x Hybridization buffer 
were added (see CHART protocol above). 25 |il of biotinylated (EBER1 or 
EBER2) ASO-streptavidin Dynabeads were added and incubated overnight 
at RT. Beads were washed three times with CHART wash buffer and once 
with 2.4 M tetraethylammonium chloride (TEACI) at 25°C, and bound RNAs 
were eluted with 2.4 M TEACI for 5 min at 40°C, followed by phenol-chloroform 
extraction prior to Northern blot or qRT-PCR analyses. 

ACCESSION NUMBERS 

The SRA accession number for the CHART-seq reported in this paper is 
SRR1 640963. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Extended Experimental Procedures, six 
figures, and one table and can be found with this article online at http://dx. 
doi.org/1 0.101 6/j.cell.201 5.01 .015. 
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SUMMARY 

A central paradigm within virology is that each viral 
particle largely behaves as an independent infectious 
unit. Here, we demonstrate that clusters of entero- 
viral particles are packaged within phosphatidylser- 
ine (PS) lipid-enriched vesicles that are non-lytically 
released from cells and provide greater infection effi- 
ciency than free single viral particles. We show that 
vesicular PS lipids are co-factors to the relevant 
enterovirus receptors in mediating subsequent infec- 
tivity and transmission, in particular to primary hu- 
man macrophages. We demonstrate that clustered 
packaging of viral particles within vesicles enables 
multiple viral RNA genomes to be collectively trans- 
ferred into single cells. This study reveals a novel 
mode of viral transmission, where enteroviral ge- 
nomes are transmitted from cell-to-cell en bloc 
in membrane-bound PS vesicles instead of as single 
independent genomes. This has implications for 
facilitating genetic cooperativity among viral quasis- 
pecies as well as enhancing viral replication. 

INTRODUCTION 

Enteroviruses are a large genus of single positive-stranded RNA 
viruses whose members including poliovirus (PV), Coxsackie- 
virus, rhinovirus, and enterovirus 68 are the causative agents 
of a number of important and widespread human diseases 
including poliomyelitis, myocarditis, hand foot and mouth dis- 
ease, the common cold, and more recently, a severe respiratory 
disease with paralytic symptoms. In addition to >70 enteroviral 
serotypes identified in humans, enteroviral quasispecies are 
common largely as a result of inherent error making and lack of 
proofreading mechanisms of viral RNA-dependent RNA poly- 
merases (RdRp). 

Enteroviral RNA genomes serve as templates for both transla- 
tion and replication, and these processes take place on host 
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intracellular membranes (den Boon and Ahlquist, 2010; Hsu 

et al., 2010). After enteroviruses have bound their specific host 
receptors either at the cell surface or within endocytic vesicles 
(Brandenburg et al., 2007), the capsid undergoes a conforma- 
tional change that allows the viral RNA to be transferred 
across the plasma membrane and/or endosomal membrane 
into the cytoplasm through a yet completely defined mechanism 
(Strauss et al., 201 3). In the cytoplasm, the enteroviral RNA is first 
translated into non-structural proteins and structural proteins, 
where the former makes up the RNA genome replication machin- 
ery and the latter the capsid. The viral RNA replication machinery 
are then assembled on the cytoplasmic membrane leaflet of ER- 
derived membranes that are subsequently modified by viral and 
host proteins to have a specific lipid blueprint of enrichment in 
phosphatidylinositol-4-phosphate (PI4P) and cholesterol lipids. 
These lipids regulate the membrane association, assembly, 
and activity of the viral replication protein complex, including 
the RdRp, and thus facilitate viral RNA synthesis (Hsu et al., 
2010; llnytska et al., 2013; Nchoutmboube et al., 2013). 

Once the enteroviral RNA is synthesized, little is known about 
where in the host cell it is packaged in capsids and how these 
capsids are released from cells. While enteroviruses have his- 
torically been considered non-enveloped (i.e., lacking a host- 
derived membrane bilayer around their capsids) and thus rely 
on cell lysis to exit, recent reports of extracellular Coxsackievirus 
B3 (CVB3) being present in vesicles (Robinson et al., 2014) and 
PV being able to spread non-lytically among host cells (Bird 
et al., 201 4) have raised important questions regarding the extra- 
cellular nature of enteroviral particles and the significance of 
non-lytic exit in the viral life cycle. Moreover hepatitis A, hepatitis 
E and blue tongue viral particles, all long considered non-envel- 
oped, have been observed surrounded by membranes (Feng 
et al., 2013; Takahashi et al., 2008; Owens et al., 2004). 

A central paradigm in virology is that viruses behave as inde- 
pendent infectious units (Flint et al., 2009). While there are ex- 
ceptions to this, such as vaccinia virus particles preventing 
superinfection by inducing the host cell to repel other virions 
(Doceul et al., 201 0), it is largely accepted that the fate of individ- 
ual viral genomes are not dependent on one another during exit 
from one cell and entry into another (Brandenburg and Zhuang 
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2007). Here, we investigate the assembly, exit, and subsequent 
infection processes of enteroviral particles using a combination 
of imaging techniques including confocal microscopy, super- 
resolution light microscopy, correlative light, and electron micro- 
scopy along with single molecule RNA fluorescence in situ 
hybridization (FISH), proteomic, and biochemical approaches. 
We show that multiple infectious enteroviral particles are clus- 
tered within individual phosphatidylserine (PS) lipid-enriched 
vesicles and non-lytically secreted out of cells. These viral parti- 
cles in vesicles are more efficient in establishing infection than 
free viral particles. We demonstrate that vesicles encapsulate 
and traffic large numbers of mature infectious viral particles be- 
tween cells and consequently enable the transfer of multiple viral 
RNA genomes collectively into new host cells by a mechanism 
that is dependent on both the virus-specific receptor of the recip- 
ient host cell as well as the vesicular PS lipids. 

RESULTS 

Assembled Poliovirus Capsids Are Localized to Viral 
RNA Replication Organelles 

We first investigated the intracellular spatio-temporal dynamics 
of newly synthesized PV particles. The generation of PV parti- 
cles, as well as enterovirus assembly in general, comprises a 
multistep process where capsid subunits (VPO, VP1 , VP3) form 
pentamers, which polymerize into capsids (Liu et al., 2010). 
Once RNA is packaged, the VPO subunits get cleaved into VP2 
and VP4 to generate mature infectious virions (Liu et al., 2010). 
From screening a large number of neutralizing antibodies, we 
identified the A12 antibody, that binds deep inside the canyon 
bridging both rims of two adjacent pentamers and thus recog- 
nizing assembled capsids (Chen et al., 201 1 , 2013), to visualize 
PV capsids within infected cells. We fixed cells at various inter- 
vals after PV infection and co-immunolabeled with Al 2, anti-viral 
VP1 to detect individual VP1 subunits, and anti-viral 3AB anti- 
bodies, the latter to detect viral 3AB to localize replication organ- 
elles where viral RNA is synthesized (Hsu et al., 2010). Newly 
assembled capsids were clearly detectable from 3-4 hr post- 
infection (p.i.) and onward, and they were localized to the replica- 
tion organelles (Figure 1A). Note that in contrast to A12 labeling, 
VP1 was localized to both the replication organelles and 
dispersed across the cytoplasm, consistent with its cytosolic na- 
ture. At 6-7 hr p.i., capsids were dispersed from the replication 
sites to the cytoplasm and sequestered in puncta (Figure 1A, 
6-7 hr p.i. inset). At this time, there is a known cessation in 
viral RNA synthesis (Ehrenfeld et al., 1970), and the timing of 
capsid release from replication organelles was coupled to viral 
RNA synthesis since prematurely inhibiting viral RNA synthesis 
with the inhibitor Guanidine HCI (Barton and Flanegan, 1997) 
triggered capsids to rapidly disperse into the cytoplasm 
(Figure SI A). 

Poliovirus Capsids Are Captured by Phosphatidylserine 
Lipid-Enriched Autophagosome-like Organelles and 
Released Non-Lytically from Cells 

Between 6 and 7 hr p.i., we found that >85% of capsids (n = 10 
cells) were on punctate cytoplasmic structures that colocalized 
with the autophagosomal membrane protein, LC3-II (Figure IB). 



By transmission electron microscopy, we observed numerous 
double-membraned autophagosome-like organelles containing 
capsids (Figure 1C). Previous reports (Taylor et al., 2009; Kirke- 
gaard and Jackson, 2005; Jackson et al., 2005) had found that 
perturbation of the host autophagy pathway led to a decrease 
in PV release from infected cells. Consistent with that, either dis- 
rupting autophagy by small interfering RNA (siRNA) depletion of 
autophagy machinery LC3 or beclin 1, or acutely stimulating 
autophagy by treating cells with tat-beclin 1 peptides, blocked 
or enhanced PV release by ~1 0-fold, respectively (Figures SIB 
and SIC) while replication was unaffected (Figure SID). How- 
ever, these capsid containing autophagosome-like organelles 
did not follow the conventional autophagy pathway and fuse 
their contents with lysosomes as inhibiting lysosomal enzymes 
did not further increase LC3-II levels beyond the 4-fold increase 
observed in PV-infected cells (Figures S1E and S1F) and none 
of the A12/LC3-II co-labeled structures contained lysosomal 
enzymes at any point during infection (Figures S1F and S1G). 
Notably, the SNARE protein syntaxin 17, normally localized to 
autophagosomes and required for fusion with lysosomes (Ita- 
kura et al., 201 2), did not localize to these Al 2/LC3-II co-labeled 
structures (Figure SI I). 

However, we found that the membranes of both the replication 
organelles and the autophagosome-like organelles contained 
negatively charged phosphatidylserine (PS) lipids. PS lipids in 
uninfected cells are primarily located at the cytoplasmic leaflets 
of the plasma and endosomal membranes as well as at the 
lumenal leaflet of the ER membrane (Leventis and Grinstein, 
2010; Kay et al., 2012). Cells were co-transfected with GFP- 
LactC2 and FAPPImRFP, cytosolic live-cell reporters for PS 
(Kay et al., 2012), and PI4P lipids, respectively, the latter to report 
on the location of the ER-derived replication organelle mem- 
branes (Hsu et al., 2010). At 4 hr p.i., cells were imaged live by 
structured illumination microscopy. Highly localized PS-rich 
membrane domains exposed to the cytoplasm were found 
distributed across the replication organelles (Figure ID, inset). 
Later, between 6 and 7 hr p.i., cells were fixed and immunola- 
beled with anti-GFP and Al 2 antibodies and imaged by confocal 
microscopy. Numerous A12 positive structures were found also 
co-labeled with GFP-LactC2 indicating the presence of PS lipids 
on their membrane leaflets exposed to the cytoplasm (Fig- 
ure IE, arrows). Consistent with this, in PV-infected live cells 
co-expressing LC3-mRFP and GFP-LactC2, >90% (organelles 
measured across ten cells) of the LC3-mRFP-labeled autopha- 
gosome-like organelles were co-labeled with GFP-LactC2 
(Figure 1 F, inset). 

We then investigated the fate of these capsid-containing or- 
ganelles during the rest of the infection time period. Between 
7 hr and 8 hr p.i., there was a 70% ± 10% (n = 15) decrease in 
the number of capsids within the cytoplasm (Figure 1G). Using 
the cell impermeable Trypan blue dye, we found that the plasma 
membrane remained intact during this time while there was an 
~6-fold increase in extracellular viral titers (Figure 1H). This 
lack of plasma membrane permeability during PV infection had 
also been previously observed (Taylor et al., 2009; Bird et al., 
2014). Although by 12 hr p.i. the cells eventually lysed (not 
shown), this data indicated that the majority of PV particles 
were released prior to cell lysis. 
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Figure 1. Poliovirus Capsids Are Captured by Phosphatidylserine Lipid-Enriched Autophagosome-like Organelles and Released Non-Lyti- 
cally from Cells 

(A) Capsids undergo dynamic spatial transitions during infection. HeLa cells infected with PV and immunolabeled with A12, anti-VPI, and anti-3AB antibodies. 
Scale bars represent 5 urn. 

(B) Capsids (A12) colocalized with autophagosome marker LC3-II. PV-infected HeLa cells were immunolabeled with A12 and anti-LC3-ll antibodies. Scale bar 
represents 5 urn. 

(C) Electron micrographs of PV-infected cells show PV capsids in double-membrane autophagosome-like organelles. Scale bars represent 5 i^m and 200 nm (inset). 

(D) PV-infected cells at 4 hr p.i. expressing GFP-LactC2 and FAPP1 mRFP imaged by structured illumination microscopy. Region of interest is magnified in right 
panel. Scale bar represents 5 i^m. 

(E) PV-infected cells at 7 hr p.i. expressing GFP-LactC2 were immunolabeled with anti-GFP and A12 antibodies. Region of interest is magnified in right panel. 
Arrows indicate A12 positive autophagosome-like organelles co-labeled with GFP-LactC2. Scale bar represents 5 [im. 

(F) Cells co-expressing GFP-LactC2 and LC3-RFP were infected with PV and imaged by confocal microscopy at 7 hr pi. Region of interest is magnified in right 
panel. Scale bar represents 5 ^m. 

(G) Capsid distribution between 7 and 8 hr p.i. PV-infected cells were immunolabeled with A12, anti-VPI, and anti-3AB antibodies. Scale bar represents 5 jam . 

(H) Plasma membrane integrity remains intact when PV exits cells. Trypan Blue diffusion across the plasma membrane was measured concurrently with 
measurements of extracellular PV titer, the latter plotted in plaque-forming units/ml (pfu/ml). 

See also Figure SI . 



Extracellular PV Particles Are Found in Uniformly Large 
Vesicles 

Scanning electron microscopy (SEM) of cells at 7 hr p.i. cells re- 
vealed numerous vesicular structures of similar size docked at 



the extracellular side of the intact plasma membrane (Figure 2A 
and inset). While these SEM images may not reflect the true 
shape and size of these vesicles within live cells, they do point 
to a striking uniformity in size. Measurements of the cross-section 
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diameter from 1 00 randomly selected extracellular vesicles from 
four different cells, showed that —90% of the vesicles were be- 
tween 250 nm and 350 nm in diameter (Figure 2B). Correlative 
fluorescence imaging in conjunction with SEM confirmed that 
these vesicles contained A12-labeled capsids (Figure 2C). 

Mature Enteroviral Particles Are Released in 
PS-Enriched Vesicles 

We next investigated whether the extracellular vesicles contain- 
ing PV retained the PS lipids that had been components of the 
autophagosome-like organelles (Figures ID and IF). Incubation 
of PV-infected cells with Alexa 568 coupled Annexin V, a non-cell 
permeable fluorescent reporter protein for PS lipids (Koopman 
et al., 1994), revealed numerous fluorescent puncta dotting the 
surface of the cell at 7hr p.i. (Figure 3A). Similar results were 
also observed with CVB3 (at 7 hr p.i.) and human rhinovirus infec- 
tions (at 1 2 hr p.i.) (Figure 3A). Note that this pattern of Annexin V 
labeling of enterovirus-infected cells was different from both 
apoptotic- and mock-infected cells: in the former, the entire 
plasma membrane was labeled with Annexin V as a result of 
PS lipids being on the extracellular membrane leaflet of the 
cell, a hallmark of apoptosis (Figure 3B, apoptosis) while in the 
latter there was no labeling since PS lipids were on the cytosolic 
leaflet of the plasma membrane (Figure 3B, mock). 

We performed time-lapse confocal/differential interference 
contrast (DIC) imaging on PV-infected cells in the presence of 
Alexa 568-Annexin V to determine if these vesicles were being 
released. We observed Annexin V-labeled vesicles emerging 
from the cell surface at discrete domains and being rapidly 
released into the extracellular medium (Figure 3C; arrow; Movie 
SI). We then quantified the amount of PS vesicles released dur- 
ing PV, CVB3, or rhinovirus infection relative to mock-infected 
cells (for each respective virus). We collected the extracellular 
medium, removed large cell debris, and enriched for vesicles 



Figure 2. Extracellular PV Particles Are 
Found in Large Uniform-Sized Vesicles 

(A) Scanning electron microscopy of a PV-infected 
cell at 7 hr p.i. Scale bar represents 3 nm. Inset 
shows higher magnification of uniform size vesicles 
docked on the extracellular side of the plasma 
membrane. Scale bar represents 1 nm. 

(B) Extracellular vesicle size distribution in PV- 
infected cells. Cross section diameter of a 100 
randomly selected extracellular vesicles from four 
different cells, were measured from scanning 
electron micrographs and plotted. Data repre- 
sented as mean ± SEM. 

(C) Correlative fluorescence and scanning elec- 
tron microscopy (SEM). PV-infected cell was im- 
munolabeled with A12 at 7 hr pi, epifluorescence 
image was obtained (right) and then sample 
was processed for SEM (left). Arrows point to 
A12-labeled extracellular vesicles. Scale bar rep- 
resents 1 nm. 



of size range 100-500 nm using differen- 
tial centrifugation. This size range was 
chosen based on our previous light and 
electron microscopy data (Figure 2). The enrichment for vesicles 
of this size range was confirmed by transmission electron micro- 
scopy (Figure S2A). The vesicles were incubated with Alexa 
568-Annexin V, and following the wash to remove any unbound 
Annexin V, placed in a spectrofluorometer. Fluorescence mea- 
surements revealed a net -9-fold, -3-fold, and -3-fold increase 
in amounts of PS vesicles collected from the supernatants 
of PV-, CVB3-, and rhinovirus-infected cells, respectively, 
compared to mock-infected cells (Figure 3D). 

We subsequently enriched for PS vesicles from the collected 
vesicle fraction using magnetic separation with Annexin V-conju- 
gated magnetic microbeads. In parallel, vesicles were also incu- 
bated with magnetic microbeads lacking Annexin V to control 
for nonspecific binding. Post-magnetic separation, the samples 
were processed for SDS-PAG E/western analysis. We found that 
the extracellular vesicles both pre- and post-Annexin V isolation, 
had a VP2/VP0 ratio —2-fold greater than the whole cell lysate 
(Figure 3E). This indicated that these extracellular released PS 
vesicles were not non-specific shedding of host membrane but 
rather conduits for the selective release of mature PV particles. 
Similar results were also obtained from human rhinovirus 2- 
infected cells where PS vesicles were found to contain mature 
viral particles (Figure 3F). 

Infection by PV Particles in Vesicles Is Dependent on 
Both the Virus-Specific Receptor and the PS Lipids 

We next compared the infection capability of PV particles within 
vesicles released from cells compared to free viral particles. Free 
viral particles were obtained by three cycles of quick freeze-thaw 
of the vesicle fraction. Freeze-thaw does not significantly impact 
PV infectivity (Strazynski et al., 2002), and no difference in 
infectivity was observed by plaque assay between equivalent 
numbers of PV particles isolated by freeze thaw or collection 
from the supernatant of post-vesicle enrichment fractions (data 
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Figure 3. Mature PV, CVB3, and Rhinovirus Particles Are Released in Phosphatidylserine Lipid Vesicles 

(A) Cells infected with PV, CVB3, or rhinovirus were incubated with Alexa 568-Annexin V and imaged by confocal/DIC microscopy at 7 hr p.i. Scale bar represents 
5 [xm for PV and rhinovirus, 10 ^m for CVB3. 

(B) Mock and apoptotic HeLa cells labeled with Alexa 568-Annexin V and imaged by confocal/DIC microscopy. Scale bar represents 10 [xm. 

(C) Dynamics of Alexa 568-Annexin V-labeled PS vesicle release from plasma membrane projections of PV-infected cells at 7 hr p.i. 

(D) Quantification of PS vesicles released from enterovirus-infected cells. 

(E) VP2/VP0 ratio of whole cell lysate and PS vesicles in PV-infected cells. Annexin V-isolated PS vesicles from PV-infected cells at 7 hr p.i. were analyzed by 
SDS- PAG E/western with anti-PV VP2 antibody. 

(F) PS vesicles from human rhinovirus-infected cells contain mature rhinoviral particles. Isolated PS vesicles were processed for SDS -PAG E/western analysis 
with anti-HRV2/VP2 (neutralizing) antibodies. 

See also Figure S2 and Movie SI . 



not shown). The collected extracellular vesicles from PV-infected 
cells or free PV particles were then incubated with a confluent 
layer of HeLa cells (Figure 4A). After 4 hr of infection with either 
vesicles or free viral particles, numerous PV-replicating (based 
on immunofluorescence labeling of VP1) infected cells were 
found in both cell populations (Figure 4A). 

To determine if the PV particles within vesicles still required the 
PV receptor CD1 55 on the host cell for infection, cells were incu- 
bated with CD155 neutralizing antibodies prior to exposure to 
vesicles and viral 3AB replication protein levels were measured 
after 4 hr of infection. In the presence of neutralizing antibodies, 
vesicle infectivity was inhibited by >95% indicating that PV ves- 



icles were not just simply “fusing” with cells but that the viral 
particles within the vesicles still required binding to PV receptor 
for transfer of viral RNA into the host cell cytoplasm (Figure 4B). 

We next determined if the infection was dependent on PS 
lipids of the vesicles. Vesicles isolated by differential centrifuga- 
tion were incubated with different amounts of Annexin V protein, 
which binds and masks the PS head-groups on the lipids 
(Swairjo et al., 1995). After removing any unbound Annexin V, 
the vesicles were added to HeLa cells and replication measured 
after 4 hr. Strikingly, masking of the vesicle-associated PS lipids 
by Annexin V, inhibited PV infection of the host cells in a dose- 
dependent manner (Figures 4C and 4D). These data indicated 
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that PS lipids are cofactors for PV infection and that mature in- 
fectious PV particles are predominantly in the PS vesicle fraction 
of vesicles collected by differential centrifugation. 

Infection by PV Particles in Vesicles Is More Efficient 
Than Free PV Particles 

We next measured and compared the level of infection of host 
cells when infected with equivalent numbers of PV particles either 
in vesicles or as free. As a measure of infection efficiency, we 
quantified and plotted the levels of viral 3AB replication proteins 
at peak replication times (Figures 4E and 4F). Due to viral RNA 
synthesis feeding back on viral RNA translation (and vice versa), 
3AB levels reflect viral RNA levels (Hsu et al., 201 0). We found that 
at 4 hr p.i. of HeLa cells, viral 3AB levels were —40% greater in 
cells incubated with PV particles in vesicles than free PV particles 
(Figure 4E). This difference was even more striking when primary 
human macrophages, cells that are specialized to recognize and 
take up PS lipid-containing cells and vesicles (Fadok et al., 1 992), 
were used as the recipient host. Here vesicle-enclosed PV parti- 
cles were almost 2-fold greater in infection efficiency compared 
to free PV particles (Figure 4F). 

Unilamellar PS Vesicles Released from PV-Infected 
Cells Contain Multiple Viral Particles 

To quantify the clustering of viral particles within vesicles, we 
collected free and vesicle-associated PV particles and immuno- 



Figure 4. Infection by PV Particles in Vesi- 
cles Is More Efficient Than Free Viral 
Particles and Is Dependent on Both the 
Poliovirus Receptor and PS Lipids 

(A) Free or vesicle-associated PV particles were 
incubated with new recipient cells and replication 
was detected at 4 hr p.i. by immunolabeling with 
anti-VPI antibodies. Scale bar represents 500 pm. 

(B) CD1 55/PVR neutralizing antibodies block 
infection PV particles in vesicles. 

(C) Blocking PS lipids on vesicles containing PV 
particles block infection. Vesicles collected by 
differential centrifugation were incubated with 
different amounts of Annexin V protein prior to in- 
cubation with recipient HeLa cells. Replication was 
measured at 4 hr p.i., by SDS- PAG E/western 
analysis with anti-3AB antibody. 

(D) Quantification of western results in (C). 

(E) HeLa cells were incubated with equal titers of 
free and vesicle-associated PV particles. Infection 
efficiency was determined by quantifying viral 3AB 
protein levels at peak replication time (4 hr p.i.). 
Data represented as mean ± SD. 

(F) Primary human macrophages were incubated 
with equal titers of free and vesicle-associated PV 
particles. Infection efficiency was determined 
by quantifying viral 3AB protein levels at peak 
replication time (8 hr p.i.). Data represented as 
mean ± SD. 



labeled them with A12 antibodies. We 
then imaged the cover-glass-immobilized 
PV particles by total-internal reflection 
fluorescence (TIRF) and by DIC micro- 
scopy (Figure 5A). Viruses were deposited on coverslip in two 
formats: vesicle-free fraction (“Free”), or vesicle-embedded 
fraction (“Vesicular”). Imaging revealed cluster of viruses within 
vesicles, while free viruses yielded a more diffuse distribution 
(Figure 5A). We quantified the difference in distribution of 
measured fluorescence by computing the radial autocorrelation 
functions g(r) for the intensity map l(Tt) : 



9(r) = (i-^n =/(")'(" + ~?) d ' “ 



The observed exponential decay of these autocorrelation 
functions enabled us to quantify the length scales that charac- 
terize the imaged aggregates (Figure 5B). We found that 
vesicle-embedded viruses yielded a characteristic clustering 
scale of 2.0 ± 0.1 pm (n = 4 images), while free virus yielded 
a five-time smaller characteristic scale of 0.4 ± 0.1 pm 
(n = 2 images). However, these estimates are too close to 
the spatial resolution of TIRF imaging and thus only qualita- 
tive. To circumvent this diffraction limit, we then applied the 
direct stochastic optical reconstruction microscopy (dSTORM) 
methodology to achieve super-resolution of these viral parti- 
cles (Figure 5C). dSTORM relies on sequential imaging and 
fitting to achieve a spatial resolution of individual dyes within 
30 nm (Bates et al., 2007; Baddeley et al., 2009; Dempsey 
et al., 2011). 
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In order to assess the degree of clustering within these viral 
spreads we used the Ripley’s K statistical test. By definition, 

with 6 (x) = { ofor x > 0 ' 

i*j K 

with djj being the distance between the /' th and y' th points, A is the 
image area. By definition, K(r)/nr 2 is around 1 for a homogenous 
distribution of points, and larger than 1 for clustered spatial dis- 
tribution (Veatch et al., 2012; Termini et al., 2014). Based on the 
electron microscopy images (Figures 2A, 2B, and S2A), we antic- 
ipated clusters of viruses within vesicles of 200—400 nm diam- 
eter. Hence, we calculated K(r = 200 nm) from the dSTORM 
data to test whether vesicles contained clusters of viruses: we 
found that K(r = 200 nm) = 2.4 ± 0.9 for free viruses and K(r = 
200 nm) = 13.7 ± 6.7 for viruses within vesicles (Figure 5D). 
Hence, super-resolution microscopy does confirm that these 
viral vesicles do pack large numbers of viruses. 

Consistent with these findings, transmission electron micro- 
graphs of PS vesicles (after isolation by Annexin V-coupled mi- 
crobeads) revealed multiple clustered viral particles surrounded 
by a single membrane bilayer (Figure 5E). Per 200 nm cross-sec- 
tion, a single vesicle contained on average 19 ± 3 PV particles 
(n = 6 vesicles). 

The presence of a single bilayer, as opposed to multiple bila- 
yers, also indicated that the double-membraned autophago- 
some-like organelles had fused with the plasma membrane, 
rather than budded, in order to release PS vesicles. Thus, the 
PS lipids on the extracellular membrane leaflet of the vesicle 
are in a compartment that is topologically equivalent to the 
lumenal membrane leaflet of the double-membraned autopha- 
gosome-like organelle. The ER is a major membrane source for 
autophagosomes (Hamasaki et al., 2013) and has lumenal mem- 
brane leaflet enriched in PS lipids (Kay et al., 2012). Given that 
the isolated PS vesicles also contain ER-resident proteins (Fig- 
ure S2B), it is highly likely that the autophagosome-like organ- 
elles, and thereby the released PS vesicles, originated from ER 
and/or ER-derived replication organelle membranes. 

Vesicles Allow Multiple Viral RNA Molecules to Be 
Collectively Transferred into Cells 

Given our dSTORM and TEM finding, we conjectured that multi- 
ple PV particles within a single vesicle might allow multiple viral 
genomes to be simultaneously transferred into a single cell. To 
test this hypothesis, we performed single molecule fluorescence 
in situ hybridization (single molecule RNA FISH) (Raj et al., 2008). 
Multiple oligonucleotide fluorescent probes hybridizing to an 
RNA molecule allow sufficient sensitivity to detect single RNA 
molecules within cells (Shaffer et al., 2013; Lubeck and Cai 
2012 ). 

Forty-eight fluorescently labeled nucleic acid probes, each 22 
nucleotides in length, complementary to the PV genome, were 
synthesized. Fixed numbers of HeLa cells were incubated with 
different titers of either free PV particles or vesicle-associated 
PV particles for 1.5 hr and processed for FISH labeling as 
described previously (Shaffer et al., 2013) (Figure 5F). Note that 
we could exclude the possibility of any viral RNA synthesis occur- 
ring during this incubation time because cells with and without 



Guanidine HCL, an inhibitor of viral RNA synthesis, showed similar 
amounts of viral RNA molecules per cell (Figures 5G and S3). 

For cells incubated with free PV particles, discrete fluorescent 
puncta, spatially segregated from one another in the cytoplasm 
were detected in individual cells (Figure F, free virus). In 
contrast for cells incubated with equivalent titers of PV particles 
in vesicles, there were many more fluorescent puncta, spatially 
juxtaposed (Figure 5F, virus in vesicle). The lowest multiplicity 
of infection with free viral particles was observed at 1.5 x 10 7 
pfu/ml where there was on average only one fluorescent puncta 
per cell (Figures 5F and 5H, free virus). Hence, the size of these 
puncta was approximated as a single viral RNA molecule and 
used in subsequent quantification and analysis of FISH data. 
At each titer, for either free virus or vesicle-associated virus, 55 
cells were randomly chosen and viral RNA molecules counted 
in each cell. The number of viral RNA molecules per cell were 
subsequently plotted in Figure 5H. 

From our quantification, we found that at 12 x 10 7 pfu/ml and 
6 x 1 0 7 pfu/ml, there were —40% and —75%, respectively, more 
viral RNA molecules per cell when PV particles were presented in 
vesicles than as free virus (Figure 5H). Indeed, for cells incubated 
with free PV particles, we found that as the viral titer decreased 
4-fold (from 12 x 10 7 pfu/ml to 3 x 10 7 pfu/ml), there was a 
—90% decrease in the number of PV RNA molecules per cell 
whereas for cells incubated with PV particles in vesicles, this 
decrease was significantly less, only —40% (Figure 5H). Further- 
more, at any given titer there were significantly more cells 
with >15 PV RNA molecules within them when they had been in- 
fected with PV in vesicles as opposed to free PV (Figure 51). 
These data are consistent with our dSTORM findings (Figures 
5A-5D) and transmission electron micrographs of isolated PS 
vesicles (Figure 5E) and indicate that vesicles contain multiple 
PV particles, which enable multiple viral RNA genomes to be 
transferred en bloc into a cell. Note that there was a large varia- 
tion in the number of viral RNA genomes per cell when cells were 
infected with PV particles in vesicles as opposed to free PV par- 
ticles (Figures 5H and 51). Since de novo viral RNA synthesis can 
be ruled out, this variation is likely due to differences in the num- 
ber of viral particles packaged per vesicle as well as contamina- 
tion from free particles due to possible vesicle lysis. 

DISCUSSION 

Here, we have shown that multiple mature infectious enteroviral 
particles are released in single unilamellar PS lipid vesicles, 
which in turn enables multiple viral genomes to be collectively 
transferred to an individual cell in a new round of infection. These 
PS vesicles containing multiple viral genomes appear to be 
significantly more efficient in infection and may facilitate genetic 
cooperation among viral genomes. 

We first detected assembled PV capsids at replication organ- 
elles where viral RNA was synthesized (Figures 1 A and 6). This 
close juxtaposition of capsids with viral RNA synthesis sites 
would serve to immediately encapsidate the viral RNA and 
thereby facilitate efficient genome packaging as well as protec- 
tion of viral genomes from host defenses. These viral particles 
then translocated into the cytoplasm, which was not only tempo- 
rally coincident with a cessation of viral RNA synthesis but also 
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ganelles fused with the plasma mem- 
brane to non-lytically release >80% of 
the PV particles into the extracellular envi- 
ronment within unilamellar vesicles of size range 200-400 nm. 
Notably, the SNARE protein syntaxin 17 was not localized to 
the autophagosome-like organelles to regulate their fusion with 
lysosomes, but instead was found sequestered away at the 
replication organelles (Figure SI I). Whether specific enteroviral 
proteins actively modulate its subcellular localization or it is an in- 
direct consequence of the affinity of the syntaxin 1 7 hydrophobic 
hairpin tail for the PI4P/cholesterol rich replication organelle 
membranes remains to be investigated. This type of non-con- 
ventional secretion of autophagosomal membranes from the 
cell has never been reported and identifying the machinery regu- 
lating this process, including determining which cytoskeletal 
components and SNARE proteins are utilized for movement 
out to the periphery and fusion with the plasma membrane, 
may provide novel therapeutic targets to block enterovirus 
release from cells. 



could be prematurely triggered by inhibiting RNA synthesis with 
GuHCL (Figure SI A). Enteroviral 2C proteins may modulate the 
close coupling between RNA synthesis kinetics and capsid 
release from replication organelles as 2C proteins are not only 
localized to the replication organelles and required for viral 
RNA synthesis but also physically interact with capsids (Liu 
et al., 2010). 

Once the capsids dissociated from the replication organelles, 
they were sequestered within double-membraned LC3-II- 
positive autophagosome-like organelles. How the viral cargo is 
recognized and captured within these organelles, the capsid- 
associated determinants, and whether LC3-II proteins play a 
role in these processes as they do in canonical autophagy path- 
ways (Rogov et al., 2014) is currently unknown. Interestingly, un- 
like canonical autophagosomes that fuse with lysosomes and 
degrade their cytoplasmic cargo, these capsid-containing or- 
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Figure 6. Model 

Assembled mature enteroviruses are released from 
the replication organelles into the cytoplasm. 
Clusters of multiple viral particles are selectively 
captured by double-membraned organelles that 
originate from the ER and ER-derived replication 
organelles. These double-membraned organelles, 
which resemble autophagosomes, contain PS 
lipids on both the lumenal and cytoplasmic leaflets 
of their membranes. They fuse with the plasma 
membrane and release a unilamellar PS-lipid-en- 
riched vesicle, containing multiple viral particles, 
into the extracellular medium. This vesicle then 
facilitates infection in a PS-lipid and viral receptor- 
dependent mechanism resulting in the collective 
transfer to a new recipient host cell of multiple viral 
RNA genomes. This mode of viral transmission 
enhances infection efficiency and potentially allows 
for genetic complementation among quasispecies. 



Figure 5. PS Vesicles Contain Clustered PV Particles, which Enable Multiple Viral RNA Genomes to Be Collectively Transferred into a New 
Host Cell 

(A) TIRF and DIC images of free and vesicle-associated PV particles labeled with Atto488-labeled A12 antibody. 

(B) Difference in distribution of fluorescence in (A) by computation of the radial autocorrelation function g(r). 

(C) dSTORM imaging free and vesicle-associated viral particles labeled with Atto488-A12 antibody. 

(D) Calculation of Ripley’s K function to assess the degree of clustering of vesicle-associated PV particles relative to free particles. Data represented as 
mean ± SEM. 

(E) PS vesicles isolated from PV-infected cells using Annexin V microbeads were imaged by transmission electron microscopy. Note the numerous electron- 
dense viral particles in each vesicle and the unilamellar surrounding membrane. Asterisk shows Annexin V microbead attached to the exterior of one vesicle. 
Scale bars represent 100 nm. 

(F) Collected intact vesicles or free viral particles from PV-infected cells were incubated with a confluent layer of new host cells at different viral titers. Viral RNAs 
were monitored by single molecule RNA FISH and imaged by dual confocal/DIC microscopy at 1 .5 hr p.i. Shown are images of single HeLa cells infected with 
either free viral particles or vesicle-associated viral particles. Images presented were acquired with the same microscopy settings. Scale bar represents 2 pm. 

(G) Collected vesicles were incubated with cells with/without GuHCL for 1 .5 hr and viral RNA molecules were monitored by single molecule RNA FISH (see also 
Figure S3 for quantification). Scale bar represents 2 pm. 

(H) Quantification of the number of viral RNA molecules per cell in cells infected with either free (n = 55 cells) or vesicle-associated PV particles (n = 55 cells). ***p < 

2 i 0 -3; i(r 4 . 

(I) Percent of cells with 15 or greater PV RNA puncta was quantified for cells infected with either free or vesicle-associated PV particles for each viral titer shown. 
Data represented as mean ± SEM. 
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PS vesicles non-lytically released from cells were selectively 
enriched in mature enteroviral particles (Figure 3). While we 
cannot exclude the possibility that non-PS vesicles may also 
contribute to the non-lytic release of enteroviral particles, the 
observed significant inhibition of subsequent infection when 
PS is blocked (Figures 4C and 4D) suggest that PS vesicles 
constitute a large fraction of the non-lytic conduit for enteroviral 
release from cells. Live-cell time-lapse imaging in the presence 
of fluorescently labeled Annexin V protein revealed that the outer 
membrane leaflet of the released vesicle and hence topologically 
equivalent to the lumenal membrane leaflet of the double 
membraned organelle pre-fusion, contained PS lipids (Figures 
3A-3D; Movie SI). The ER and/or the ER-derived replication or- 
ganelles both have PS lipids on their lumenal and cytoplasmic 
leaflets (Leventis and Grinstein, 2010; Lev, 2012) (Figure ID), 
and moreover, the PS can flip between these two leaflets (Clark, 
2011). Thus it is likely that the host source for the autophago- 
some-like organelles is the ER and/or ER-derived replication or- 
ganelles. Supporting this conclusion the released PS vesicles 
indeed contained ER markers including the integral ER mem- 
brane protein calnexin (Figure S2B). 

Super-resolution imaging and transmission electron micro- 
scopy revealed extracellular PS vesicles to be containing multi- 
ple viral particles, at least 20 per 200 nm cross-section (Figures 
5A-5E). Supporting these data, in single molecule RNA FISH ex- 
periments, we found that infection by PV particles in vesicles al- 
lowed the collective transfer of multiple viral genomes into a sin- 
gle host cell (Figures 5F-5I). One important implication of these 
findings is that it ties the fate of individual viral genomes from 
previous rounds of replication to each other and thereby may 
provide selective advantages in terms of replication kinetics 
and genetic diversity relative to free viral particle genomes 
(i.e., not in vesicle). Indeed, infection efficiency was significantly 
higher when cells were infected with PV particles in vesicles as 
opposed to an equivalent number of free virus particles (Figures 
4E and 4F). Enteroviruses as well as all other positive-stranded 
RNA viruses, are enormously diverse in genomic variety due to 
the inherent error rates and lack of proof reading in their RNA 
polymerases that can generate large numbers of viral quasispe- 
cies after even a single round of infection (Borderia et al., 2011). 
Vesicular transfer of multiple particles among cells would in- 
crease the chances of genetic complementation among viral 
quasispecies, potentially benefiting the replication efficiency of 
otherwise attenuated or weak genomes and enabling them to 
maintain a presence in the genetic pool. Indeed our findings 
may provide a cellular mechanism to explain the results of Vi- 
gnuzzi et al. (2006) where cooperative interactions between 
neurotropic and non-neurotropic PV quasispecies were re- 
ported. Second, when multiple viral genomes are transferred 
to a single cell, the likelihood of one or more genomes surviving 
the hostile host environment to override host defenses and repli- 
cate may be higher. Third, for positive-stranded RNA viruses, 
rather than a single genome having to switch between RNA 
translation and RNA synthesis activities (until sufficient levels 
of viral RNA have been synthesized to partition those functions 
among genomes), multiple genomes could right from the start 
of infection partition RNA translation and RNA synthesis func- 
tions among themselves and enhance overall replication kinetics 



and viral protein levels. Finally the PS lipids on the vesicles 
themselves could enhance infection efficiency by attracting PS 
scavenging cells, such as macrophages and dendritic cells, to 
take up the viral genomes and provide a host environment for 
replication. In particular, we found that PV particles within vesi- 
cles could replicate significantly more efficiently within primary 
macrophages than free PV particles (Figure 4F). 

What is the mechanism whereby viral particles within vesicles 
infect host cells? We found infection to be dependent not only on 
the virus-specific receptor expressed by the host but also on the 
PS lipids associated with the vesicle having access to the host 
cell (Figures 4C and 4D). One potential mechanism to explain 
these findings is that PS lipids on vesicles engage PS receptors 
on the recipient host cell prior to the viral particles engaging their 
own receptors. The binding to PS receptors can trigger phago- 
cytic uptake of vesicles (Hoffmann et al., 2001) followed by lysis 
or permeabilization of the vesicle within the endosomal compart- 
ment to then enable viral particles to engage their specific recep- 
tors (Figure 6 ). Equally possible, binding to PS receptors may 
lead to permeabilization of the vesicle on the cell surface, result- 
ing in the release of a concentrated bolus of viral particles in the 
immediate vicinity of the cell. The latter mechanism may also 
provide access to neutralizing antibodies which bind capsids 
and block infection. Reliance on PS lipids and PS receptors for 
infection has been documented for a number of other viruses 
including vaccinia, Dengue, Ebola, hepatitis A virus, and HIV 
(Sui et al., 2006; Mercer and Helenius, 2009; Feng et al., 2014; 
Morizano and Chen, 2014). The specific use of PS lipids by en- 
teroviruses as well as other viruses to traffic between cells may 
have significant in vivo implications for viral pathogenesis and 
tissue tropism. In particular, the infection of primary macro- 
phages, the major PS sensing cells in the body, may provide 
enteroviruses with the ability to target a key cell subset of the im- 
mune system while suppressing its inflammatory responses, as 
PS lipids have been well documented to inhibit inflammatory 
cytokine production by macrophages (Hochreiter-Hufford and 
Ravichandran, 2013). PS-lipid vesicles may also help enterovi- 
ruses exploit the natural motility of macrophages and help 
spread them distant sites including perhaps the CNS (Ousman 
and Kubes, 2012). 

In summary, we report here a novel mode of viral transmission 
among cells where multiple viral particles are clustered and 
collectively released within PS-lipid-enriched vesicles. This pro- 
vides greater infection efficiency and potentially an opportunity 
for cooperation and complementation among viral quasispecies. 
This mode of transmission links the fate of multiple viral particles 
to one another and may have implications for maintaining viral 
genetic diversity within viral evolution. 

EXPERIMENTAL PROCEDURES 

All detailed protocols and information regarding plasmids, antibodies, cell cul- 
ture, virus infection, and propagation are provided in the Extended Experi- 
mental Procedures. 

Immunofluorescence 

Cells were plated on glass coverslips and fixed with 4% PFA for 15 min at RT. 
Cells were permeabilized with either 0.2% Saponin or 0.1 % Triton X-100 and 
sequentially incubated with primary and fluorophore-tagged secondary 
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antibodies. Coverslips were mounted in Fluoromount-G (Southern Biotech) 
and imaged. 

Confocal Microscopy 

All confocal images were obtained with an LSM780 laser scanning confocal 
microscope system (Carl Zeiss) and images were analyzed with either Zen 
(Carl Zeiss) or Image J (NIH) software. 

dSTORM 

Free PV particles and PV particles in vesicles were plated on gridded glass bot- 
tom dishes and fixed with 4% PFA for 15 min at room temperature. Subse- 
quently they were permeabilized with 0.2% saponin and incubated with 
Atto488-conjugated A12 antibodies. An oxygen-scavenging PBS solution 
(10 mM NaCI, 0.5 mg/ml glucose oxidase, 40 g/ml catalase, 2% glucose, 
and 10 mM MEA) was used for imaging. dSTORM images were obtained on 
a Zeiss ELYRA PS.1 system (Carl Zeiss). Images were acquired with a Plan- 
Apochromat IOOx/1.46 oil immersion objective and an Andor iXon 885 
EMCCD camera. A total of 20,000 images were acquired per sample with an 
exposure time of 33 ms. Raw images were reconstructed and analyzed with 
ZEN software (Carl Zeiss) and MATLAB (MathWorks) using methodology 
from Veatch et al. (2012) and Termini et al. (2014). 

Structured Illumination Microscopy 

Super-resolution 3D-structured illumination microscopy (SIM) imaging was 
performed on a Zeiss ELYRA PS.1 system (Carl Zeiss). Images were acquired 
with a Plan-Apochromat 63x/1 .40 oil immersion objective and an Andor iXon 
885 EMCCD camera. Fifteen images per plane (five phases, three rotations) 
and 0. 1 25 mm z section of 3 mm height were required for generating super res- 
olution images. Raw images were reconstructed and processed to demon- 
strate structure with greater resolution by the ZEN 201 1 software (Carl Zeiss). 

Single Molecule RNA FISH 

We performed single molecule RNA FISH according to Shaffer et al. (2013). 
Cells were incubated with either free or vesicle-associated PV particles for 
1.5 hr. Cells were subsequently fixed in pre-chilled methanol (-20°C) for 
10 min. The methanol was removed and the cells were hybridized with 1 0 plI 
of hybridization buffer containing 4 pM PV probe, 10% formamide, 2x SSC, 
and 10% dextran sulfate for 10 min at 37°C. Next, the samples were washed 
three times with pre-warmed wash buffer (10% formamide and 2x SSC) at 
37°C for 1 min and imaged with Zeiss LSM780 confocal microscope. 

Annexin V Labeling 

Cells for live imaging were grown on coverslip-bottomed Lab-Tek chambers 
(Thermo Fisher) and infected with PV for 7 hr. Cells were then replaced in im- 
aging media (DMEM Phenol Red free supplemented with 1 0% FBS and 50 mM 
HEPES [pH 7.3]). Alexa 568-Annexin V was added on the cells and imaging 
was performed on a Zeiss LSM780 Confocal Laser Scanning microscope 
(Carl Zeiss) equipped with 458 nm, 488 nm, 514 nm, 565 nm, and 633 nm laser 
lines and detecting system for fluorescence and DIC imaging. The microscope 
was additionally equipped with a heating stage and incubator with tempera- 
ture, humidity, and C0 2 control for live-cell imaging. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Extended Experimental Procedures, three 
figures, and one movie and can be found with this article online at http://dx.doi. 
org/1 0.1 01 6/j. cell. 201 5.01 .032. 

ACKNOWLEDGMENTS 

The authors would like to thank Yasmine Belkaid, Konstantin Chumakov, Ana 
Maria Cuervo, James Hogle, Gerald Feldman, Jennifer Jones, Jennifer Lippin- 
cott-Schwartz, Sanford Simon, Radek Dobrowolski, Wilma Friedman, Ellie 
Ehrenfeld and members of the Altan-Bonnet lab for fruitful discussions. The 
authors would like to especially thank Wen-Chin Tseng for help in constructing 
the graphical cartoons. The authors also thank Frank Macaluso and Geoffrey 



Perumal of the Albert Einstein College of Medicine Analytical Imaging facility 
for technical support. A.C., P.T., and C.P. were supported by grant 
AI091 985-01 Al ; G.A.B. was supported by grant AI083408; H.M., P.J., and 
E.W. were supported by grant AI15122; and Y.C., W.D., M.H., C.B., P.C., 
E.S., and N.A.B. were supported by intramural NIH funds. 

Received: October 2, 2014 
Revised: December 13, 2014 
Accepted: January 12, 2015 
Published: February 12, 2015 

REFERENCES 

Baddeley, D., Jayasinghe, I.D., Cremer, C., Cannell, M.B., and Soeller, C. 
(2009). Light-induced dark states of organic fluochromes enable 30 nm reso- 
lution imaging in standard media. Biophys. J. 96, L22-L24. 

Barton, D.J., and Flanegan, J.B. (1997). Synchronous replication of poliovirus 
RNA: initiation of negative-strand RNA synthesis requires the guanidine- 
inhibited activity of protein 2C. J. Virol. 77, 8482-8489. 

Bates, M., Huang, B., Dempsey, G.T., and Zhuang, X. (2007). Multicolor super- 
resolution imaging with photo-switchable fluorescent probes. Science 317, 
1749-1753. 

Bird, S.W., Maynard, N.D., Covert, M.W., and Kirkegaard, K. (2014). Nonlytic 
viral spread enhanced by autophagy components. Proc. Natl. Acad. Sci. 
USA 111, 13081-13086. 

Borderfa, A.V., Stapleford, K.A., and Vignuzzi, M. (201 1). RNA virus population di- 
versity: implications for inter-species transmission. Curr. Opin. Virol. 7 , 643-648. 
Brandenburg, B., and Zhuang, X. (2007). Virus trafficking - learning from single- 
virus tracking. Nat. Rev. Microbiol. 5, 197-208. 

Brandenburg, B., Lee, L.Y., Lakadamyali, M., Rust, M.J., Zhuang, X., and 
Hogle, J.M. (2007). Imaging poliovirus entry in live cells. PLoS Biol. 5, el 83. 
Chen, Z., Chumakov, K., Dragunsky, E., Kouiavskaia, D., Makiya, M., Neverov, 

A. , Rezapkin, G., Sebrell, A., and Purcell, R. (2011). Chimpanzee-human 
monoclonal antibodies for treatment of chronic poliovirus excretors and emer- 
gency postexposure prophylaxis. J. Virol. 85, 4354-4362. 

Chen, Z., Fischer, E.R., Kouiavskaia, D., Hansen, B.T., Ludtke, S.J., Bidzhieva, 

B. , Makiya, M., Agulto, L., Purcell, R.H., and Chumakov, K. (2013). Cross- 
neutralizing human anti-poliovirus antibodies bind the recognition site for 
cellular receptor. Proc. Natl. Acad. Sci. USA 110, 20242-20247. 

Clark, M.R. (2011). Flippin’ lipids. Nat. Immunol. 72, 373-375. 

Dempsey, G.T., Vaughan, J.C., Chen, K.H., Bates, M., and Zhuang, X. (2011). 
Evaluation of fluorophores for optimal performance in localization-based 
super-resolution imaging. Nat. Methods 8, 1027-1036. 
den Boon, J.A., and Ahlquist, P. (2010). Organelle-like membrane compart- 
mentalization of positive-strand RNA virus replication factories. Annu. Rev. 
Microbiol. 64, 241-256. 

Doceul, V., Hollinshead, M., van der Linden, L., and Smith, G.L. (2010). Repul- 
sion of superinfecting virions: a mechanism for rapid virus spread. Science 
327, 873-876. 

Ehrenfeld, E., Maizel, J.V., and Summers, D.F. (1970). Soluble RNA polymer- 
ase complex from poliovirus-infected HeLa cells. Virology 40, 840-846. 
Fadok, V.A., Voelker, D.R., Campbell, P.A., Cohen, J.J., Bratton, D.L., and 
Henson, P.M. (1992). Exposure of phosphatidylserine on the surface of 
apoptotic lymphocytes triggers specific recognition and removal by macro- 
phages. J. Immunol. 148, 2207-2216. 

Feng, Z., Hensley, L., McKnight, K.L., Hu, F., Madden, V., Ping, L., Jeong, S.H., 
Walker, C., Lanford, R.E., and Lemon, S.M. (2013). A pathogenic picornavirus 
acquires an envelope by hijacking cellular membranes. Nature 496, 367-371. 
Feng, Z., Li, Y., McKnight, K.L., Hensley, L., Lanford, R.E., Walker, C.M., and 
Lemon, S.M. (2014). Human pDCs preferentially sense enveloped hepatitis A 
virions. J. Clin. Invest. 725, 169-176. 

Flint, S.J., Enquist, L.W., Racaniello, V.R., and Skalka, A.M. (2009). Principles 
of Virology, Third Edition (ASM Press). 



Cell 160, 619-630, February 12, 2015 ©2015 Elsevier Inc. 629 




Cell 



Hamasaki, M., Furuta, N., Matsuda, A., Nezu, A., Yamamoto, A., Fujita, N., 
Oomori, H., Noda, T., Haraguchi, T., Fliraoka, Y., et al. (2013). Autophago- 
somes form at ER-mitochondria contact sites. Nature 495, 389-393. 
Hochreiter-Hufford, A., and Ravichandran, K.S. (2013). Clearing the dead: 
apoptotic cell sensing, recognition, engulfment, and digestion. Cold Spring 
Harb. Perspect. Biol. 5, a008748. 

Hoffmann, P.R., deCathelineau, A.M., Ogden, C.A., Leverrier, Y., Bratton, D.L., 
Daleke, D.L., Ridley, A.J., Fadok, V.A., and Henson, P.M. (2001). Phosphatidyl- 
serine (PS) induces PS receptor-mediated macropinocytosis and promotes 
clearance of apoptotic cells. J. Cell Biol. 155, 649-659. 

Hsu, N.Y., llnytska, O., Belov, G., Santiana, M., Chen, Y.H., Takvorian, P.M., 
Pau, C., van derSchaar, H., Kaushik-Basu, N., Balia, T., etal. (2010). Viral reor- 
ganization of the secretory pathway generates distinct organelles for RNA 
replication. Cell 141, 799-811. 

llnytska, O., Santiana, M., Hsu, N.Y., Du, W.L., Chen, Y.H., Viktorova, E.G., 
Belov, G., Brinker, A., Storch, J., Moore, C., et al. (2013). Enteroviruses 
harness the cellular endocytic machinery to remodel the host cell cholesterol 
landscape for effective viral replication. Cell Host Microbe 14, 281-293. 
Itakura, E., Kishi-ltakura, C., and Mizushima, N. (2012). The hairpin-type tail- 
anchored SNARE syntaxin 17 targets to autophagosomes for fusion with 
endosomes/lysosomes. Cell 151 , 1256-1269. 

Jackson, W.T., Giddings, T.H., Jr., Taylor, M.P., Mulinyawe, S., Rabinovitch, 
M., Kopito, R.R., and Kirkegaard, K. (2005). Subversion of cellular autophago- 
somal machinery by RNA viruses. PLoS Biol. 3, el 56. 

Kay, J.G., Koivusalo, M., Ma, X., Wohland, T., and Grinstein, S. (2012). Phos- 
phatidylserine dynamics in cellular membranes. Mol. Biol. Cell 23, 21 98-2212. 
Kirkegaard, K., and Jackson, W.T. (2005). Topology of double-membraned 
vesicles and the opportunity for non-lytic release of cytoplasm. Autophagy 
1, 182-184. 

Koopman, G., Reutelingsperger, C.P., Kuijten, G.A., Keehnen, R.M., Pals, S.T., 
and van Oers, M.H. (1994). Annexin V for flow cytometric detection of 
phosphatidylserine expression on B cells undergoing apoptosis. Blood 84, 
1415-1420. 

Lev, S. (2012). Non-vesicular lipid transfer from the ER. Cold Spring Harb. 
Perspect. Biol. 4, 1-17. 

Leventis, P.A., and Grinstein, S. (2010). The distribution and function of phos- 
phatidylserine in cellular membranes. Annu. Rev. Biophys. 39, 407-427. 

Liu, Y., Wang, C., Mueller, S., Paul, A.V., Wimmer, E., and Jiang, P. (2010). 
Direct interaction between two viral proteins, the nonstructural protein 2C 
and the capsid protein VP3, is required for enterovirus morphogenesis. 
PLoS Pathog. 6, el 001 066. 

Lubeck, E., and Cai, L. (2012). Single-cell systems biology by super-resolution 
imaging and combinatorial labeling. Nat. Methods 9, 743-748. 

Mercer, J., and Helenius, A. (2009). Virus entry by macropinocytosis. Nat. Cell 
Biol. 11, 510-520. 

Morizano, K., and Chen, I.S. (2014). Role of phosphatidlyserine receptors in 
enveloped virus infection. J. Virol. 88, 4275-4290. 



Nchoutmboube, J.A., Viktorova, E.G., Scott, A.J., Ford, L.A., Pei, Z., Watkins, 
P.A., Ernst, R.K., and Belov, G.A. (2013). Increased long chain acyl-Coa 
synthetase activity and fatty acid import is linked to membrane synthesis 
for development of picornavirus replication organelles. PLoS Pathog. 9, 
el 003401. 

Ousman, S.S., and Kubes, P. (2012). Immune surveillance in the central 
nervous system. Nat. Neurosci. 15, 1096-1101. 

Owens, R.J., Limn, C., and Roy, P. (2004). Role of an arbovirus nonstructural 
protein in cellular pathogenesis and virus release. J. Virol. 78, 6649-6656. 

Raj, A., van den Bogaard, P., Rifkin, S.A., van Oudenaarden, A., and Tyagi, S. 
(2008). Imaging individual mRNA molecules using multiple singly labeled 
probes. Nat. Methods 5, 877-879. 

Robinson, S.M., Tsueng, G., Sin, J., Mangale, V., Rahawi, S., McIntyre, L.L., 
Williams, W., Kha, N., Cruz, C., Hancock, B.M., et al. (2014). Coxsackievirus 
B exits the host cell in shed microvesicles displaying autophagosomal 
markers. PLoS Pathog. 10, el 004045. 

Rogov, V., Dotsch, V., Johansen, T., and Kirkin, V. (2014). Interactions between 
autophagy receptors and ubiquitin-like proteins form the molecular basis for 
selective autophagy. Mol. Cell 53, 167-178. 

Shaffer, S.M., Wu, M.T., Levesque, M.J., and Raj, A. (2013). Turbo FISH: a 
method for rapid single molecule RNA FISH. PLoS ONE 8, e75120. 

Strauss, M., Levy, H.C., Bostina, M., Filman, D.J., and Hogle, J.M. (2013). RNA 
transfer from poliovirus 135S particles across membranes is mediated by long 
umbilical connectors. J. Virol. 87, 3903-3914. 

Strazynski, M., Kramer, J., and Becker, B. (2002). Thermal inactivation of polio- 
virus type 1 in water, milk and yoghurt. Int. J. Food Microbiol. 74, 73-78. 

Sui, L„ Zhang, W., Chen, Y., Zheng, Y., Wan, T„ Zhang, W., Yang, Y., Fang, G., 
Mao, J., and Cao, X. (2006). Human membrane protein Tim-3 facilitates hep- 
atitis A virus entry into target cells. Int. J. Mol. Med. 17, 1093-1099. 

Swairjo, M.A., Concha, N.O., Kaetzel, M.A., Dedman, J.R., and Seaton, B.A. 
(1995). Ca(2+)-bridging mechanism and phospholipid head group recognition 
in the membrane-binding protein annexin V. Nat. Struct. Biol. 2, 968-974. 
Takahashi, M., Yamada, K., Hoshino, Y., Takahashi, H., Ichiyama, K., Tanaka, 
T., and Okamoto, H. (2008). Monoclonal antibodies raised against the ORF3 
protein of hepatitis E virus (HEV) can capture HEV particles in culture superna- 
tant and serum but not those in feces. Arch. Virol. 153, 1703-1713. 

Taylor, M.P., Burgon, T.B., Kirkegaard, K., and Jackson, W.T. (2009). Role of 
microtubules in extracellular release of poliovirus. J. Virol. 83, 6599-6609. 
Termini, C.M., Cotter, M.L., Marjon, K.D., Buranda, T., Lidke, K.A., and Gillette, 
J.M. (2014). The membrane scaffold CD82 regulates cell adhesion by altering 
a4 integrin stability and molecular density. Mol. Biol. Cell 25, 1560-1573. 
Veatch, S.L., Machta, B.B., Shelby, S.A., Chiang, E.N., Holowka, D.A., and 
Baird, B.A. (2012). Correlation functions quantify super-resolution images 
and estimate apparent clustering due to over-counting. PLoS ONE 7, e31457. 
Vignuzzi, M., Stone, J.K., Arnold, J.J., Cameron, C.E., and Andino, R. (2006). 
Quasispecies diversity determines pathogenesis through cooperative interac- 
tions in a viral population. Nature 439, 344-348. 



630 Cell 160, 619-630, February 12, 2015 ©2015 Elsevier Inc. 




Article 



Cel 

A Serpin Shapes the Extracellular Environment to 
Prevent Influenza A Virus Maturation 



Graphical Abstract 




Authors 

Meike Dittmann, 

Hans-Heinrich Hoffmann 

Paul D. Bieniasz, Charles M. Rice 

Correspondence 

ricec@rockefeller.edu 

In Brief 

Plasminogen activator inhibitor (PAI-1) 
blocks surface glycoprotein maturation of 
influenza A virus, thus reducing virus 
spread in the airways and revealing that 
the innate immune system, driven by type 
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environment to inhibit viruses. 
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SUMMARY 

Interferon-stimulated genes (ISGs) act in concert to 
provide a tight barrier against viruses. Recent studies 
have shed light on the contribution of individual ISG 
effectors to the antiviral state, but most have exam- 
ined those acting on early, intracellular stages of 
the viral life cycle. Here, we applied an image-based 
screen to identify ISGs inhibiting late stages of influ- 
enza A virus (IAV) infection. We unraveled a directly 
antiviral function for the gene SERPINE1 , encoding 
plasminogen activator inhibitor 1 (PAI-1). By target- 
ing extracellular airway proteases, PAI-1 inhibits 
IAV glycoprotein cleavage, thereby reducing infec- 
tivity of progeny viruses. This was biologically rele- 
vant for IAV restriction in vivo. Further, partial PAI-1 
deficiency, attributable to a polymorphism in human 
SERPINE1, conferred increased susceptibility to IAV 
in vitro. Together, our findings reveal that mani- 
pulating the extracellular environment to inhibit the 
last step in a virus life cycle is an important mecha- 
nism of the antiviral response. 

INTRODUCTION 

As obligate intracellular parasites, the fate of viruses is intricately 
linked to the metabolism of their host cells. Viruses exploit host 
cell mechanisms throughout their life cycle, from entry to replica- 
tion, assembly and egress, and finally maturation. Each step in 
this cycle is a potential point for antiviral intervention. 

Virus-infected cells produce interferons (IFNs) that trigger 
transcriptional upregulation of IFN-stimulated genes (ISGs) 
(reviewed in Schneider et al., 201 4). The combined action of hun- 
dreds of expressed ISG proteins creates multiple lines of de- 



fense against viral infection. It is not surprising that different 
ISGs block different pathways and have varying potency against 
different classes of viruses (Schoggins et al., 2014). Our knowl- 
edge of ISG effector functions has been somewhat biased to- 
ward inhibition of early replication stages, such as entry, or viral 
translation initiation. However, viral assembly and release have 
proven to be an Achilles heel for viruses and are the targets of 
therapeutic small molecules (influenza A virus [IAV] neuramini- 
dase inhibitors; HIV-1 protease inhibitors) and intrinsic ISGs 
such as tetherin (Neil et al., 2008) or viperin (Wang et al., 2007). 

In this study, we established a screen that would also uncover 
host factors inhibiting late stages of the viral replication cycle, us- 
ing IAV as a model virus. Applying an approach that accurately 
monitors several rounds of viral replication in the presence of sin- 
gle overexpressed ISGs, we discovered a direct antiviral function 
of the well-known gene SERPINE1, which encodes PAI-1 (Ny 
et al., 1986). We demonstrate that PAI-1 inhibits IAV maturation 
by targeting host proteases needed for viral glycoprotein cleavage 
and that this is physiologically relevant in natural proteolytic land- 
scapes both in vitro and in vivo. Finally, we link a human genetic 
polymorphism in extracellular PAI-1 production to enhanced IAV 
susceptibility in vitro. In all, we show that shaping the extracellular 
environment can be a powerful mechanism in the vast arsenal of 
the innate immune response against invading pathogens. 

RESULTS 

AGain-of-Function Screen Reveals SERPINE1 Encoding 
PAI-1 as an Inhibitor of IAV Spread 

To identify ISG effectors targeting late stages of the IAV life cycle, 
we established a high-throughput, image-based screen using an 
extended version of a recently published ISG library comprising 
401 cDNAs (Schoggins et al., 2011; Table SI). Human lung 
adenocarcinoma cells (A549) were transduced with lentiviral 
vectors to express individual ISGs (Figure 1 A). After 48 hr, cells 
were challenged with IAV WSN/33 (H1N1) at an MOI of 0.01. 
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Figure 1. High-Throughput Microscopy Screens for Inhibitors of IAV Spread 

(A) Screening workflow. Shown are hypothetical effects of expressing inhibitory (antiviral) or non-inhibitory ISGs on single or multiple rounds of virus replication. 
Red, transduced cells; green, infected cells; blue, DAPI-stained nuclei. 

(B) Effect of 401 single ISGs on IAV spread. ISGs inhibiting more than 2-fold SD in two independent screens are shown in red. Spread ratio, the ratio of infected 
cells at 24/8 hpi. A positive control for inhibition is a-HA antibody. 

(C) Confirmation assays for selected ISGs on A549 cells or primary NHBE cells. Data are represented as mean ± SEM from n = 6 values in two independent 
experiments for A549, and n = 3 for NHBE cells. 

(D) SEPP//VE7 (PAI-l)-mediated inhibition of A/Vietnam/1 203/2004(HALo), A/California/04/2009, and A/sw/Texas/41 99-2/1 998 in A549 cells. Empty vector, 
negative control; IFITM3 and BST2 (tetherin), positive controls. Data are represented as mean ± SEM from n = 4 independent experiments. One-way ANOVA and 
Dunn’s multiple comparison test versus “empty.” 

(E) ISG-expressing A549 cells were infected with IAV WSN/33 at MOI 0.01 , and virus titers were measured by plaque assay on MDCK cells. Data are represented 
as mean ± SEM from n = 4 independent experiments. 

See also Figure SI. 
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“Spread ratio” was calculated from the number of infected cells 
at 24 hr post-infection (hpi) relative to 8 hpi for each ISG (Figures 
1 A and SI A). 

The screen was performed twice, using independently gener- 
ated lentivirus libraries (Figure 1 B). a-HA antibody, with a spread 
ratio of ~2, was a positive control, whereas empty vector con- 
trols had a spread ratio of 50 to 60 (Figures SIC and 1 B). Nine- 
teen ISGs reduced the IAV spread ratio to <20, greater than 
two SDs from the empty vector control in both screens (Figure 1 B 
and Table S2). Among these ISGs were several broadly acting 
antiviral factors involved in pattern recognition and IFN signaling, 
such as DDX58 (RIG-1), I FI HI (MDA5), IRF2, IRF7, IL28RA 
(IFNLR1), inflammatory cytokines, CCL5 (RANTES), and broadly 
acting or lAV-specific inhibitors, such as IFITM3, Mxl , and BST2 
(Schneider et al., 2014). IFITM3 and Mxl act early (IAV entry or 
replication), whereas BST2, also known as tetherin, prevents 
release of budding virus particles at the host cell surface. 
Although conflicting data exist on IAV inhibition by tetherin (Man- 
geat et al., 2012; Watanabe et al., 201 1 ; Yondola et al., 2011), it 
potently inhibited spread in A549 cells and was subsequently 
used as a positive control. We also identified a number of ISGs 
with previously uncharacterized antiviral activities: MAP3K14, 
ELF1, PMM2, FAM46C, TBX3, SC02, CRY1 , TNFSF10 (TRAIL), 
XAF1, MAFF, and SERPINE1 (serine protease inhibitor, member 
El). We validated this set of genes with independently gener- 
ated, high-titer lentiviral stocks and A549 cells, as well as normal 
human bronchial epithelial cells (NHBE). All but FAM46C, which 
did not inhibit virus spread on NHBE cells, were confirmed as 
antiviral (Figure 1C). To exclude false positives due to cytotox- 
icity, we tested cell proliferation, apoptosis, DNA damage, and 
cell death profiles by high-throughput microscopy (HTM) (Fig- 
ure SID). We found that only expression of MAP3K14, the 
most potent hit in our screen, and TNFSF10 were cytotoxic rela- 
tive to the empty vector control. 

Because protease inhibitors have been used clinically to treat 
other viruses (e.g., HIV), an endogenous effector with a similar 
function was a promising lead. We therefore focused on exploring 
the antiviral action of SERPINE1, encoding PAI-1. SERPINE1 
expression inhibited spread of various clinical IAV isolates, in- 
cluding a derivative of a highly pathogenic avian H5 influenza virus, 
modified to remove the polybasic cleavage site in the viral hemag- 
glutinin (Steel et al., 2009), AA/ietnam/1 203/2004(HALo) (H5N1), 
the pandemic A/California/04/2009 (HI N1), and an isolate of swine 
origin, A/sw/Texas/41 99-2/1 998 (H3N2) (Figure ID). In multi-step 
viral growth kinetics, SERPINE1 expression reduced extracellular 
IAV WSN/33 titers ~1 0-fold, comparable to inhibition by tetherin 
(Figure 1 E). 

This versatile SERPIN family member has been implicated in 
many physiological processes, including regulation of fibrino- 
lysis (reviewed in Declerck and Gils, 2013). However, since an 
antiviral effector function of PAI-1 protein in the context of the 
intrinsic immune response is novel, we set out to determine its 
role in restricting IAV infection. 

IAV Infection Enhances Secretion of PAM , which Is Both 
Necessary and Sufficient for IAV Inhibition 

We first studied the kinetics of SERPINE1 gene expression, as 
well as PAI-1 protein production and secretion. We compared 



A549 cells and the more relevant in vitro model of NHBE-derived, 
differentiated human ciliated airway epithelium cultures (HAEC), 
which mimic both the morphology and physiology of the airway 
epithelium in vivo. In A549 cells, SERPINE1 mRNA was slightly 
upregulated upon IFN-|3 stimulation and following infection 
with IAV WSN/33 (Figure 2A). This was not due to nonrespon- 
siveness of A549 cells, since other ISGs were highly upregulated 
(Figures S2A-S2C). TGF-p is known to trigger SERPIN El 
expression via the canonical Wnt/p-catenin pathway (He et al., 
2010). Indeed, TGF-p treatment of A549 cells strongly induced 
SERPIN El expression with no or modest effects on ISG15, 
IFITM3, and BST2 mRNA levels (Figures S2A-S2D). Stimulation 
of SERPINE1 gene expression led to increased intracellular and 
extracellular levels of PAI-1 (Figures 2B, 2C, S2E, and S2F). 
Consistent with PAI-1 being efficiently secreted, total PAI-1 
levels in the supernatant were about 16-fold higher than in 
respective IFN-p-treated cell lysates at 24 hr (Figures 2B and 
2C). We observed apical secretion of PAI-1 by HAEC after either 
IAV WSN/33 infection (Figure 2D) or TGF-p treatment (Fig- 
ure S2G). Of note, even mock-treated A549 cells and HAEC 
constantly produced and secreted basal levels of PAI-1 that 
accumulated over time (Figures 2B-2D). However, PAI-1 is 
further upregulated by certain stimuli, including virus infection. 

To test possible IAV inhibition by extracellular PAI-1 , we added 
recombinant active PAI-1 (rPAI-1) to the supernatant of A549 
cells during IAV infection. rPAI-1 decreased IAV WSN/33 spread 
in a dose-dependent manner (Figure 2E). Conversely, we used a 
polyclonal a-PAI-1 antibody that targets the a-helix F of PAI-1 to 
neutralize PAI-1 ’s ability to inhibit proteases (Komissarov et al., 
2005). Strikingly, we found that addition of a-PAI-1 antibody to 
A549 cells during infection significantly enhanced IAV spread 
compared to the IgG control (Figure 2E). 

Thus far, the extracellular environment in our experiments was 
defined by components of the growth medium or by proteins 
secreted by the cultured cells (endogenous or overexpressed). 
To test the potency of PAI-1 in a more natural setting, we exam- 
ined IAV growth kinetics on HAEC in the presence or absence of 
rPAI-1 . Strikingly, addition of rPAI to the apical side of HAEC 
significantly reduced IAV growth compared to carrier control, 
with ~1 0-fold lower infectivity at 48 hpi (Figure 2F). In contrast, 
addition of a-PAI-1 antibody dramatically enhanced IAV growth 
as early as 12 hpi and continued throughout the course of infec- 
tion. In both A549 cells and HAEC, human parainfluenzavirus 3 
(HPIV3) was unaffected by either of these treatments, showing 
that the effect was selective for IAV and not due to cytotoxicity 
(Figures S2H and S2I). 

Experiments probing which stage of the viral life cycle was 
affected by PAI-1 showed no effects on early stages, replication, 
or egress (Figures S3A-S3E). However, a reduction of progeny 
particle infectivity (Figure S3F), together with the extracellular 
presence of PAI-1 (Figure 2) and its well-described function as pro- 
tease inhibitor (Declerck and Gils, 2013), led us to investigate this 
activity as the mechanistic basis of PAI-1 -mediated IAV inhibition. 

PAI-1 Targets Airway Proteases Needed 
for Extracellular IAV Maturation 

Infectivity of IAV progeny particles requires a maturation cleav- 
age of the viral hemagglutinin (HA0 to HA1 and HA2), catalyzed 
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Figure 2. SERPINE1 Gene Expression Pro- 
files and the Role of Extracellular PAM Pro- 
tein for IAV Inhibition 

(A-C) A549 cells were infected with IAV WSN/33, 
treated with IFN-p, or mock-treated. (A) SERPINE1 
mRNA expression was normalized relative to 
housekeeping gene RPS-11. Fold increase over 
pre-treatment control levels is shown. Accumulated 
total (B) intracellular and (C) extracellular protein 
levels of PAI-1 were determined by ELISA. Data are 
represented as mean ± SEM from n = 3 experi- 
ments. 

(D) HAEC infected with IAV WSN/33 or mock 
treated. Accumulated PAI-1 protein in repeated 
apical washes was determined by ELISA. Data are 
shown as mean ± SEM from n = 3 replicates. 

(E) A549 cells were infected with IAV WSN/33 in the 
presence of either rPAI-1 (1) or a-PAI-1 antibody (2), 
and virus spread was assayed by HTM at 48 hpi. 
The number of infected cells was normalized to 
buffer (1) or IgG (2). Data are represented as mean ± 
SEM from n = 4 independent experiments. Statis- 
tical significance relative to empty control was 
determined by t tests. 

(F) HAECs were infected with IAV WSN/33 in the 
presence of either rPAI (1) or a-PAI-1 (2) and buffer 
(1) or IgG (2). Progeny virus was collected from 
apical washes, and rPAI-1 or a-PAI-1 were re- 
plenished after each wash. Virus titers were deter- 
mined by plaque assay. Data are shown as mean ± 
SEM from n = 3 replicates. 

See also Figure S2. 



by host proteases (Lazarowitz and Choppin, 1975; Skehel and 

Waterfield, 1975). We tested whether known HA-cleaving prote- 
ases might be direct targets for PAI-1 inhibition. PAI-1 inhibits 
protease function by forming a covalent, SDS-stable bond with 
the target protease, detectable in gel shift assays. Urokinase 
plasminogen activator (uPA), a major target of PAI-1 , formed a 
complex of ~85 kDa with rPAI-1 , but not in the presence of the 
PAI-1 inhibitor triplaxtinin (Figure 3A). TPCK-trypsin is another 
established target of PAI-1 and a prototype chymotrypsin-like 
serine protease (Olson et al., 2001). To allow multiple rounds 
of IAV infection in cells lacking endogenous HA-cleaving pro- 
teases, such as A549, trypsin is typically added in low con- 
centrations to the tissue culture medium. We readily detected 
PAI-1 -trypsin complexes formed in vitro and their degradation 
products, which were again absent when triplaxtinin was added 
(Figure 3B). Furthermore, we identified human tryptase and hu- 
man airway trypsin (HAT) as PAI-1 targets (Figures 3C and 3D). 
In contrast, we were unable to detect complexes of PAI-1 and 
furin, although this interaction has been previously reported (Fig- 
ure S4) (Bernot et al., 2011). 

The use of host proteases for maturation is not unique to IAV. 
Paramyxoviruses also use airway proteases to cleave their sur- 
face fusion (F) glycoprotein, and Sendai virus (SeV) utilizes the 
same extracellular airway proteases as IAV (Tashiro et al., 
1992). In contrast, HPIV3 F is cleaved intracellularly by ubiqui- 
tous endoproteases such as furin. Using the spread assay and 
trypsin as protease, we found that HPIV3 was not inhibited by 
PAI-1, whereas IAV WSN/33 and SeV both exhibited reduced 



spread (Figures 3E-3G). SERPINE1* encodes a catalytically 
inactive PAI-1 due to a single point mutation in the reactive cen- 
ter loop and served as a loss-of-function negative control (Law- 
rence et al., 1994). These data clearly demonstrate that PAI-1 
targets trypsin-like airway proteases needed for extracellular 
virus maturation. 

PAI-1 Prevents HA Cleavage of IAV Progeny Particles 

Next, we examined the efficiency of HA cleavage in the pres- 
ence of PAI-1 and trypsin. We found that rPAI-1 completely 
blocked trypsin-mediated cleavage of HA0 into HA1 and HA2 
(Figure 4A). Furthermore, expression of wild-type PAI-1, but 
not the inactive mutant, reduced the specific infectivity of IAV 
Puerto Rico/8/34 grown in the presence of trypsin (Figure 4B). 
We found similar results for the extracellular airway proteases 
transmembrane protease serine 2 (TMPRSS2) and HAT, both 
known HA-cleaving proteases (Hatesuer et al., 2013; Fig- 
ure 4C). Treatment with additional trypsin prior to virus titration 
restored specific infectivity to the level of virus grown in the 
presence of either protease, suggesting that the defect in spe- 
cific infectivity was indeed caused by uncleaved HA (Figures 
4B and 4C). 

We next investigated cleavage efficiency of HA from 
different clades: HI of A/Puerto Rico/8/34, H2 of A/Japan/ 
305/57, and wild-type H5 of highly pathogenic A/Vietnam/ 
1203/2004 (Figures 4D and 4E). Both H1 PR8 and H2 Japan 
were uncleaved at baseline but were readily cleaved by either 
exogenous trypsin or overexpressed TMPRSS2 (Figure 4D). 



634 Cell 160, 631-643, February 12, 2015 ©2015 Elsevier Inc. 




Cell 



uPA - 1 

rPAI-1 + + + + + - + 

triplaxtinin + 



trypsin - - — 1 

rPAI-1 + + + + +- + 

triplaxtinin + 




□ 



tryptase 

rPAI-1 + + + + +- + 

triplaxtinin + 

kDa 75 ■ 



HAT - 



rPAI-1 + + + + +- + 

triplaxtinin + 




. PAI-1 / 

3 tryptase 
complexes 



◄PAI-1 

|tryptase 




Figure 3. Protease Targets of PAM 

(A-D) Representative gel shift assays to show 
complexes of recombinant PAI-1 (rPAI-1) and 
indicated proteases. 1 pg of rPAI-1 was combined 
with increasing amounts of protease, and the 
mixture was separated on SDS gels followed by 
silver staining. Where indicated, triplaxtinin was 
used to inhibit PAI-1 activity. 

(E-G) HTM spread assay in the presence of PAI-1 
wild-type (. SERPINE1 ), loss-of-function mutant 
(SERPINE1*), and controls for HPIV3-GFP (E), IAV 
WSN/33 (F), or SeV-GFP (G). Data are shown as 
mean ± SEM from at least n = 6 replicates from 2 
independent experiments. One-way ANOVA and 
Kruskal-Wallis post-test. 

See also Figure S4. 



Taken together, these data show that 
PAI-1 inhibits IAV spread by reducing 
extracellular cleavage maturation of pro- 
geny particles and that this mechanism 
operates in a physiologically relevant 
model of airway epithelium. 
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The slight differences in HA1 size between trypsin- and 
TMPRSS2 -treated HA has been previously shown to be 
caused by altered glycosylation patterns in TMPRSS2-overex- 
pressing cells (Bertram et al., 2010a). Further, we found that 
wild-type but not mutant PAI-1 inhibited TMPRSS2-mediated 
cleavage of both HA subtypes. In contrast, cleavage of 
H5 V ietnam, which is mediated by endogenous intracellular pro- 
teases such as furin, was not impaired by PAI-1 (Figure 4E). 
This result is in line with data from PAI-1 -furin gel shift assays 
(Figure S4) and from experiments with furin-dependent HPIV3 
(Figures S2H and S2I), further suggesting that furin is indeed 
not inhibited by PAI-1. 

Finally, we examined the cleavage state of HA on HAEC. These 
cultures produce and secrete intrinsic airway proteases on their 
apical side; in fact, they have previously enabled identification of 
several proteases that cleave HA (Bottcher et al., 2006). Addition 
of rPAI-1 to HAEC during IAV WSN/33 infection dramatically 
reduced HA cleavage, whereas addition of a-PAI-1 increased 
cleavage (Figure 4F). These findings are in agreement with our 
previous results demonstrating virus growth kinetics in HAEC 
(Figure 2F). 



Serpinel Mice Exhibit Enhanced 
IAV Infection and More Severe 
Disease Pathology 

To examine the role of PAI-1 during IAV 
infection in vivo, we infected Serpinel 
~ or Serpine 1 +/+ (B6) control mice intrana- 
sally with lAVs Puerto Rico/8/34, WSN/ 
33, or vesicular stomatitis virus (VS V) as 
a control. Serpinel was transcriptionally 
upregulated in B6 mouse lungs upon 
IAV infection (Figure 5B). We also de- 
tected increasing levels of murine PAI-1 
(mPAI-1) in mouse lungs over the course 
of IAV infection with a 10-fold increase at 5 dpi compared to 
PBS-treated B6 (Figure 5C). Both Serpinel mRNA and mPAI-1 
protein were undetectable in Serpinel lung homogenates. 
lAV-infected Serpinel mice exhibited significant weight loss 
and succumbed to infection on average 1 day earlier than B6 
parental mice (Figures 5D, 5E, 5G, and 5H). This difference was 
not observed in mice infected with VSV control (Figure S5A). 
Despite the modest difference in survival, lung IAV titers at late 
times of infection were significantly elevated in Serpinel (Fig- 
ures 5F and 51), which correlated with the levels of mPAI-1 . Keller 
et al. (2006) used Serpinel mice to study the effect of influ- 
enza virus infection on thrombosis, and they found no impact 
of PAI-1 on IAV infection, as measured by the amount of viral 
RNA at day 4 dpi. However, given the role of PAI-1 in blocking 
viral maturation, the levels of infectious virus particles (Figures 
5F and 51) are a more relevant measure. Serpine~ /+ mice ex- 
hibited an intermediate phenotype with respect to weight loss, 
survival, and murine lung PAI-1 levels, indicating a co-dominant 
effect of PAI-1 on IAV infection (Figure S5B). 

Serpinel lungs were increased in size at 5 days post- IAV 
infection, with enhanced necrosis compared to lungs from B6 
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Figure 4. Effect of PAM on Maturation of IAV HA 

(A, D, and E) BHK cells were transfected in serum-free MEM to express HA of different origins. Whole-cell lysates were analyzed by western blot. HA cleavage 
products HA1 and HA2 are indicated, and relative band intensities are shown at the bottom of the gel. 

(A) HA of A/Puerto Rico/8/34 origin. Shown are cells treated with TPCK-trypsin, TPCK-trypsin and rPAI-1 , or buffer. 

(B) BHK cells were transfected to express GFP as negative control, PAI-1 wild-type {SERI), or loss-of-function mutant {SER*) and then challenged with IAV Puerto 
Rico/8/1 934 in the presence of TPCK-trypsin. Supernatants were harvested and assayed for infectivity by focus forming assay on MDCK cells and for viral RNA by 
qRT-PCR. Plaque assays were performed in duplicate with or without additional post-harvest incubation with TPCK-trypsin. Specific infectivity was determined 
by calculating the ratio of FFU to vRNA copies, and values were normalized to GFP control. Data are shown as mean ± SEM from n = 6 replicates from 2 in- 
dependent experiments. 
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controls (Figure 5A). Bleeding into alveoli was slightly enhanced 
in infected Serpinel ~ /_ lungs at 4 dpi, which might be expected 
given PAI-1 ’s role in fibrinolysis (Figure S5C). Cytokine expres- 
sion was elevated in lAV-infected Serpine1~ / ~ lungs compared 
to B6 control lungs, indicating increased inflammation (Fig- 
ure S5D). In contrast, cytokine levels in uninfected lungs or in 
lungs from mice intranasally challenged with poly(l:C) were com- 
parable between genotypes (Figure S5E). Thus, the increased 
weight loss and death observed in Serpinel ~ f ~ mice during 
IAV infection cannot be simply attributed to a global upregulation 
of cytokines. Whereas cytokine levels began to decrease in lAV- 
infected B6 lungs at 5 dpi, suggesting a subsiding infection, they 
stagnated in Serpine1~ / ~ lungs, indicating an ongoing infection 
in Serpinel mice. This correlated with the presence of higher 
IAV titers in Serpinel mice at this late time point. 

To further confirm the in vivo phenotype, we generated mu- 
rine tracheal epithelial cultures (MTEC) from the trachea of 
Serpinel or B6 wild-type mice, similar to FIAEC. After chal- 
lenge with A/X-31(H3N2), we found significantly increased 
vRNA and infectious virus in the Serpinel cultures at later 
times of infection (Figure 5J). These results strengthen the 
physiological relevance of Serpinel as regulator of IAV spread 
and a component of the host barrier that restricts IAV infection 
in vivo. 

Natural PAM Deficiency in Human Fibroblasts Is 
Correlated with Increased Spread of IAV 

Several single nucleotide polymorphisms (SNPs), often linked to 
bleeding disorders, have been described for SERPINE1 (Fay 
et al., 1997; Zhang et al., 2005). SNP rs6092 is located in the 
signal peptide sequence of SERPINE1 (c.A43T), resulting in sub- 
stitution of alanine by threonine (PAI-1 p.A15T) and partial intra- 
cellular retention of PAI-1 . In fact, a heterozygous rs6092 carrier 
had 30% reduced serum PAI-1 and a tendency to hemorrhage 
(Zhang et al., 2005). We characterized IAV susceptibility in three 
human fibroblast lines with at least one rs6092 allele. Two of 
these are heterozygous for SERPINE1 C.A43T (designated as 
T/A 4 and 5, Figure 6), and one is homozygous (A/A 6). We 
compared them to three control lines, all encoding wild-type 
PAI-1 (T/T 1-3). 

We found that IAV WSN/33 spread was significantly enhanced 
in the three C.A43T fibroblast lines compared to controls (Figures 
6A and 6B). This was not due to enhanced replication, at least for 
T/A 5 and A/A 6 because the number of infected cells during one 
round of replication was similar to controls (Figure 6C). In multi- 
cycle growth experiments, we observed some variation between 
control lines, which is common, as donors are genetically hetero- 
geneous. However, we found that each C.A43T fibroblast line 
produced significantly more infectious virus compared to each 
of the three controls (Figure 6D). 



To determine whether this phenotype was selective for IAV, 
we challenged the fibroblasts with HPIV3. Indeed, two of our 
three C.A43T fibroblast lines, T/A 5 and A/A 6, did not support 
increased spread or replication of HPIV3, as determined by 
HTM, or in multicycle growth assays (Figures 6E-6G). However 
T/A 4 exhibited both enhanced HPIV3 spread and replication 
compared to controls, in which we had previously observed 
enhanced IAV replication (Figure 6C), suggesting that this cell 
line may have other properties unrelated to SERPINE1 that pro- 
mote virus replication and spread. 

Extracellular PAI-1 levels of C.A43T fibroblast cultures were 
about 50% that of controls, which was consistent with previous 
reports (Figure 6H; Zhang et al., 2005). We next attempted to 
rescue IAV spread inhibition by adding rPAI-1 to the culture me- 
dium of A/A 1 , T/A 5, and A/A 6 cells during IAV WSN/33 infec- 
tion. rPAI-1 dramatically reduced virus spread in T/A 5 and A/A 
6 cells down to the level of T/T 1 control cells in a dose-depen- 
dent manner. In contrast, control STAT1~ f ~ fibroblasts, which 
support enhanced spread, but do not harbor C.A43T nor have 
reduced extracellular PAI-1 levels, could not be rescued to the 
level of T/T 1 controls (Figure 61). 

These results show that a natural extracellular PAI-1 defi- 
ciency correlates with increased IAV spread. 

DISCUSSION 

We identified the serine protease inhibitor PAI-1 as an ISG that 
restricted IAV spread, and we characterized its previously unde- 
scribed antiviral function. PAI-1 is a 50 kDa glycoprotein and the 
main physiological inhibitor of urokinase/tissue plasminogen 
activators (uPA/tPA), both major regulators of the fibrinolytic sys- 
tem. uPA/tPA convert the zymogen plasminogen into plasmin, 
triggering a proteolytic cascade to dissolve blood clots (re- 
viewed in Declerck and Gils, 2013). PAI-1 is present in plasma, 
platelets, and the extracellular matrix and is secreted by endo- 
thelial, smooth muscle, and immune cells in different tissues, 
including the airway. Other than uPA/tPA, PAI-1 inhibits multiple 
serine proteases of the chymotrypsin type, with varying effi- 
ciencies (Irving et al., 2000). 

Here, we find three new PAI-1 protease targets: human 
tryptase (tryptase Clara; club cell secretory protein), HAT, 
and TMPRSS2, all of which are involved in extracellular IAV glyco- 
protein cleavage. This finding suggests a role for PAI-1 as an anti- 
viral factor by targeting extracellular maturation of IAV particles. 

IAV maturation involves cleavage of its surface glycoprotein 
HA into HA1 and HA2. These proteins remain linked via a disul- 
fide bond (Skehel and Waterfield, 1975), which forms a hinge, 
a prerequisite for viral envelope fusion during entry (Klenk 
et al., 1975; Lazarowitz and Choppin, 1975). Because IAV does 
not encode its own protease, HA cleavage depends on the 



(C) BHK cells were transfected to co-express GFP as negative control, PAI-1 wild-type ( SER ), or loss-of-function mutant ( SER *), and TMPRSS2 or HAT as bait 
protease. Cells were then challenged, and virus infectivity was analyzed as described in (B). 

(D) HA cleavage assay with HI of A/Puerto Rico/8/34 or H2 of A/Japan/305/57 origin in the presence of protease TMPRSS2 and wild-type {SER) or loss-of- 
function {SER*) PAI-1 . 

(E) HA cleavage assay with H5 of A/Vietnam/1203/2004 origin and wild-type {SER) PAI-1 , or GFP as a negative control. 

(F) HA cleavage assay on HAEC infected with IAV WSN/33 in the presence of either rPAI (1) or a-PAI-1 (2) and buffer (1) or IgG (2). Progeny virus particles from 
apical washes at 24 hpi were analyzed. 
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Figure 5. Role of Serpinel in Lethal IAV Infection in Mice 

(A-F) Wild-type (B6) or Serpinel -/ mice were infected intranasally with 36 LD 50 of IAV Puerto Rico/8/34. Statistical significance was determined by individual 
t tests (D, F, G, and I) or log rank test (E and H). 

(A) Representative images of infected lungs 5 dpi. 

(B and C) Homogenates of infected mouse lungs were assayed for Serpinel mRNA levels by qRT-PCR (B) and mPAI-1 protein levels by ELISA (C). Data are shown 
as mean ± SEM from n = 5 mice per group. 

(D and E) Weight loss (D) and survival (E) of infected mice. Data are shown as mean ± SEM from n = 25 infected B6 or Serpinel mice, and n = 3 respective PBS 
control animals. 

(F) Homogenates of infected lungs were assayed for IAV titers by focus forming assay on MDCK cells. Data are shown as mean ± SEM from n = 17 B6 and 1 1 
Serpinel mice per day. 

(G-l) Wild-type (B6) or Serpinel mice were infected intranasally with 36 LD 50 of IAV WSN/33 and monitored for weight loss (G) and survival (H). Data are shown 
as mean ± SEM from n = 32 infected B6 or Serpinel ' mice, and n = 3 respective PBS control animals. IAV lung titers (I) shown as mean ± SEM from n = 9 B6 or 
Serpinel-'- mice per day. 



(legend continued on next page) 
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presence of host proteases at the site of IAV replication. Furin 
and PC5/6, members of the subtilisin-like family of serine-prote- 
ases, are known to cleave HA intracellularly (Horimoto et al., 
1994; Stieneke-Grober et al., 1992). We could not find evidence 
for inhibition of furin by PAI-1 . Human tryptase, HAT, TMPRSS2, 
TMPRSS13, mosaic serine protease large-form (MSPL), matrix 
metalloproteinase 9 (MMP9), and plasmin are known extracel- 
lular HA-cleaving proteases (Lazarowitz et al., 1973; reviewed 
in Bertram et al., 2010b). Most are members of the chymo- 
trypsin-like serine protease family, the “target family” of PAI-1 . 
Here, we provide direct proof that PAI-1 efficiently inhibits 
trypsin- and TMPRSS2-mediated cleavage of HA, and we hy- 
pothesize that PAI-1 might inhibit other members of this prote- 
ase family as well. 

Accessibility to glycoprotein cleavage is a major determinant 
of IAV pathogenicity (Bosch et al., 1981). Whether, and at what 
efficiency, an HA subtype can be cleaved by a given protease 
is determined by its cleavage site sequence. Highly pathogenic 
avian H5 and H7 HAs contain a multi-basic cleavage site, which 
makes them accessible to ubiquitiously expressed, intracellular 
proteases like furin (Bosch et al., 1981). Hence, viral particles 
of these subtypes are immediately mature and infectious upon 
budding. Less pathogenic IAV HAs, in contrast, contain a single 
arginine residue in their cleavage site, which makes them depen- 
dent on cleavage by extracellular airway proteases, such as 
TMPRSS2. Variation in the amino acid sequence surrounding 
this arginine alters cleavage efficiency (Galloway et al., 2013). 
Recent studies in TMPRSS2~ / ~ mice revealed a direct influence 
on pathogenicity of IAV strains relying on this protease (Hatesuer 
et al., 2013; Sakai et al., 2014; Tarnow et al., 2014). In the 
absence of TMPRSS2, IAV of H1N1, H3N2, or H7N9 origin 
were unable to spread from the trachea to the lungs, making 
the mice highly resistant. This was not the case for highly patho- 
genic H5N1 virus (Sakai et al., 2014). Thus, sensitivity to PAI-1 - 
mediated inhibition (or inhibition by other protease inhibitors) 
will depend on the nature of the cleaving protease, as well as 
on the cleavage efficiency of a specific HA subtype by that pro- 
tease (Figure 5). In this regard, it will be interesting to determine 
whether the protease inhibitor PAI-1 has differential impact on 
pathogenicity of different IAV subtypes in vivo. 

Not surprisingly, HA cleavage has been proposed as a target 
for antiviral therapy. Synthetic inhibitors, such as morpholinos 
or peptide mimetic protease inhibitors, have been used to target 
single proteases (Bottcher-Friebertshauser et al., 2011). Leu- 
peptin of actinomyces origin, the synthetic drug camostat, or 
aprotinin from bovine lungs all have broader antiprotease activity 
(Beppu et al., 1997; Lee et al., 1996;Tashiro et al., 1987; Zhirnov 
et al., 2011). So does recombinant mucus protease inhibitor 
(MPI), the only previously known airway protease inhibitor of hu- 
man origin with anti-lAV activity (Beppu et al., 1 997). The efficacy 
of applying these molecules to restrict IAV has been demon- 
strated both in vitro and in vivo and validates HA cleavage inhibi- 
tion as an attractive antiviral strategy. Our data show that PAI-1 



employs this mechanism and is the first known host protease 
inhibitor that functions to protect the host during natural IAV 
infection. 

PAI-1 is a somewhat unconventional ISG, as it is constitu- 
tively expressed, but can be further upregulated by IFN and 
other cytokines like IL-6, IL-1, TGF-p, and TNF-a (Medcalf, 
2007). This is significant because HA-cleaving proteases are 
often upregulated by IAV infection, skewing the protease-prote- 
ase inhibitor balance in favor of the virus (Kido et al., 2012). 
Thus, PAI-1 may tip the balance back toward the host. Its regu- 
lation by TGF-p might be particularly relevant during IAV infec- 
tions because IAV neuraminidase can activate latent TGF-p 
(Carlson et al., 2010). Other sources of TGF-p in the airway dur- 
ing infection include secretion by macrophages and by epithe- 
lial cells. 

It is interesting that upregulation of SERPINE1 gene expres- 
sion during infection is not as dramatic as for other ISGs, 
possibly reflecting PAI-1 ’s critical role in other local and sys- 
temic physiological processes, where massive overproduction 
could be deleterious for blood clotting, cancer metastasis, or 
cell adhesion and migration (reviewed in Declerck and Gils, 
2013). PAI-1 is the only SERPIN that spontaneously— and 
quickly— adopts a latent form in vivo, which provides a regula- 
tory mechanism, both spatially and temporally (Berkenpas 
et al., 1995). 

The respiratory tract is a major portal for virus entry into the 
body but is protected by a multilayered antiviral fence. Mucus 
acts as a physical extracellular barrier. Mucins and surfactant 
proteins in mucus trap and aggregate virus particles outside of 
cells or act as decoy receptors inhibiting early steps in the viral 
replication cycle (Hartshorn et al., 2006; Reading et al., 2008). 
Mucins are present in airways at constitutive levels but are upre- 
gulated during inflammation (Turner and Jones, 2009). We now 
add PAI-1 as an additional component of this extracellular bar- 
rier. PAI-1 is the first extracellular directly antiviral ISG, and its 
promiscuity suggests that it may also play an important role in 
host defense against other respiratory viruses that rely on this 
step in their life cycles. 

Many virus families require a maturation cleavage of viral 
surface glycoproteins, generally realized by serine proteases. 
Some use virus-encoded proteases, such as picornaviruses or 
retroviruses (Brody et al., 1992; Lee et al., 1993). Others, such 
as paramyxoviruses, orthomyxoviruses, coronaviruses, filovi- 
ruses, or arenaviruses, rely on host proteases (Glowacka 
et al., 2011; Lenz et al., 2001; Skehel and Waterfield, 1975; 
Steinhauer and Plemper, 2012; Volchkov et al., 1998). All are 
potentially sensitive to inhibition of their activating proteases 
by host or pharmacological protease inhibitors. Indeed, we 
show that a single inhibitor, PAI-1, significantly inhibits spread 
of IAV and SeV. Further exploration of the virus— activating 
protease— host inhibitor troika may open new avenues for anti- 
viral intervention and deepen our understanding of virus matura- 
tion and pathogenicity. 



(J) MTECs cultured from B6 o r Serpine1~ f ~ mice were infected with A/X-31 (H3N2) at MOI = 10 -6 . vRNA was quantified from lysed MTEC by qRT-PCR. In parallel, 
infectious virus titers were determined by TCID 50 assay on MDCK cells. Data are shown as mean ± SEM from n = 3 B6 or Serpinel MTECs. Statistical 
significance was determined by two-way ANOVA. 

See also Figure S5. 
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Figure 6. Susceptibility of PAM -Deficient Human Fibroblasts to IAV 

(A-H) SV-40-immortalized human fibroblasts from different donors, wild-type (T/T 1-3), heterozygous (T/A 4 and 5), or homozygous (A/A 6) for SERPINE1 SNP 
rs6092 were infected with IAV WSN/33 (A-D, H, and I) or HPIV3-GFP (E-G). 

(A) Representative images from single wells infected with IAV WSN/33 at 48 hpi. 

(B) IAV spread shown as the ratio between number of infected cells at 12/48 hpi. 

(C) Percent of infected fibroblasts after one round of replication. Data in (A) and (B) are represented as mean ± SEM from n = 4 independent experiments, each in 
triplicate. 

(D) IAV WSN/33 growth on fibroblasts. PFU in the supernatants were determined by plaque assay on MDCK cells. Data are represented as mean ± SEM from n = 4 
independent experiments. Statistical significance was determined by individual t test, comparing the pooled SNP-carrying cells with the pooled control cells TT2 
and TT3 for every time point. 

(E) HPIV3-GFP spread shown as the ratio between number of infected cells at 48 and 24 hpi. 

(F) Percent of infected fibroblasts after one round of replication. Data in (E) and (F) are represented as mean ± SEM from n = 2 independent experiments, each in 
triplicate. 

(G) HPIV3-GFP growth on fibroblasts. TCID 50 in supernatants were determined on LLC-MK2 cells. 

(H) Extracellular PAI-1 levels from IAV WSN/33-infected fibroblasts during spread assay at 48 hpi. Data are shown as mean ± SEM from total n = 8 replicates from 
n = 4 experiments. 

(I) Rescue of virus inhibition on selected human fibroblast lines by adding rPAI-1 to the cultures during IAV WSN/33 infection. Maximum addition of rPAI-1 (black 
bars) was 500 ng, minimum addition (light gray bars) 125 ng. Data are represented as mean ± SEM from n = 4 independent experiments, each in quadruplicate. 
One-way ANOVA and Dunnett’s multiple comparison post-test against T/T 1- values of the same condition. 
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In conclusion, we identify SERPINE1 /PAI-1 as a host factor in- 
hibiting IAV spread and show that PAI-1 mechanistically acts by 
blocking maturation of progeny IAV particles, thus reducing par- 
ticle infectivity. Naturally occurring human genetic variations in 
PAI-1 impact the ability of IAV to spread from cell to cell, thus 
identifying PAI-1 as a candidate human gene that may influence 
susceptibility to IAV and the outcome of infection. In this regard, 
it will be of immediate interest to determine whether PAI-1 
deficiency might be linked with the severity of human IAV infec- 
tion. Finally, our work suggests that localized administration of 
PAI-1 to the respiratory tract might provide a new therapeutic ap- 
proach for treating IAV and other respiratory viruses that require 
extracellular protease-driven maturation. 

EXPERIMENTAL PROCEDURES 

High-Throughput Microscopy Screening of Host Factor Library 

Using Gateway technology, we transferred a previously published library of 
ISGs (Schoggins et al., 201 1) from ORFEXPRESS entry clones into a new len- 
tiviral expression vector (pSCRPSY), co-expressing TagRFP as a transduction 
control and puromycin resistance. A549 cells in 96-well plates were first trans- 
duced and then challenged with IAV WSN/33 at MOI 0.01 . Due to varying 
transduction efficiency between the lentiviral stocks, the percentage of non- 
transduced cells was variable. IAV production from non-transduced cells 
masked potential inhibitory effects of ISGs and was eliminated by adding pu- 
romycin 1 hr prior to IAV infection. Lentiviral transduction was performed in 
duplicate plates for each ISG, where one sample was fixed after one round 
of replication (8 hpi), defining the number of initial producer cells, and the other 
after several rounds of replication (24 hpi). “Spread ratio” was calculated by 
dividing the lAV-infected cells at 24 hpi by 8 hpi for each individual ISG over 
empty vector-transduced cells (Figures 1 A, SI A, and SI B) and yielded results 
that were stable over a wide range of transduction efficiencies (Figure SI B). In- 
fected cells were detected by staining for IAV nucleoprotein (NP) and HTM 
(Figures 1A, SI A, and SIB). 

Mice 

Mice were housed in an AAALAC-accredited facility in accordance with the 
Guide for the Care and Use of Laboratory Animals. All procedures outlined 
in the study were approved by The Rockefeller University’s Institutional Animal 
Care and Use Committee. Wild-type (C57BL/6J) and Serpine~ l (B6.129S2- 
Serpine 1tm1Ml9/J ) breeder pairs were purchased from Jackson Laboratory, 
and colonies were established in-house. Six- to ten-week-old mice of both 
sexes were used in IAV challenge, and 10- to 14-week-old mice were used 
for VSV experiments. 

Human Cell Lines 

Informed consent was obtained from human subjects according to local reg- 
ulations. Samples were subsequently transferred for experimental testing 
under respective ethics approvals at INSERM in France and at Rockefeller Uni- 
versity in the USA. All protocols involving the use of human tissue were re- 
viewed and exempted by The Rockefeller University Institutional Review 
Board. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Extended Experimental Procedures, five 
figures, and three tables and can be found with this article online at http:// 
dx.doi.org/1 0. 1 01 6/j.cell.201 5.01 .040. 
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SUMMARY 

Transcription in eukaryotes produces a number of 
long noncoding RNAs (IncRNAs). Two of these, 
MALAT1 and Menp, generate a tRNA-like small 
RNA in addition to the mature IncRNA. The stability 
of these tRNA-like small RNAs and bona fide tRNAs 
is monitored by the CCA-adding enzyme. Whereas 
CCA is added to stable tRNAs and tRNA-like tran- 
scripts, a second CCA repeat is added to certain un- 
stable transcripts to initiate their degradation. Here, 
we characterize how these two scenarios are dis- 
tinguished. Following the first CCA addition cycle, 
nucleotide binding to the active site triggers a clock- 
wise screw motion, producing torque on the RNA. 
This ejects stable RNAs, whereas unstable RNAs 
are refolded while bound to the enzyme and sub- 
jected to a second CCA catalytic cycle. Intriguingly, 
with the CCA-adding enzyme acting as a molecular 
vise, the RNAs proofread themselves through differ- 
ential responses to its interrogation between stable 
and unstable substrates. 

INTRODUCTION 

The CCA-adding enzyme adds the nucleotide triplet CCA to the 
3' end of all tRNAs (Deutscher, 1982), a step that is essential for 
tRNA aminoacylation (Sprinzl and Cramer, 1979) and correct 
tRNA positioning in the ribosome (Nissen et al., 2000). It is an 
intriguing member of the nucleotidyltransferase family, since it 
operates without a nucleic acid template and without transloca- 
tion along the tRNA (Shi et al., 1998a). Instead, the tRNA 3'-end 
progressively refolds during synthesis and nucleotide selectivity 
switches from cytidine to adenosine after the addition of two 
C-nucleotides (Tomita et al., 2006; Xiong and Steitz, 2004). In 
addition to acting upon newly synthesized tRNA transcripts, 
the CCA-adding enzyme is able to accurately repair partially 
degraded tRNA 3' ends. Interestingly, CCA-adding enzymes 
fall into two divergent classes with only the catalytic domains 
sharing significant homology. In class I enzymes (present in 
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archaea), bound tRNA is involved in the selection of the correct 
nucleotides (Xiong and Steitz, 2004), whereas in class II enzymes 
(in eubacteria and eukaryotes), proper nucleotides are selected 
by protein only (Li et al., 2002; Tomita et al., 2004). 

Although the CCA-adding enzyme was long thought to termi- 
nate polymerization once a single CCA triplet had been added, 
recent work on tRNA-like small RNAs derived from long non-cod- 
ing RNAs (IncRNAs) indicated this is not always the case (Sunwoo 
et al., 2009; Wilusz et al., 2011). Metastasis-associated long 
adenocarcinoma transcript 1 (MALAT1) and Menp (also known 
as NEAT1) are well-characterized long, nuclear-retained non- 
coding transcripts that are involved in cancer progression (Ji 
et al., 2003; Lin et al., 2007) and paraspeckle formation (Clemson 
et al., 2009; Imamura et al., 2014; Sunwoo et al., 2009), respec- 
tively. In addition, both of these IncRNAs contain sequences 
that mimic the tRNA fold and are processed by the canonical 
tRNA biogenesis machinery to generate small RNAs (Wilusz 
et al., 2008; Wilusz et al., 2011). Just as for bona fide tRNAs, 
the trinucleotide CCA is post-transcriptionally added to the 3'- 
end of MALAT1 -associated small cytoplasmic RNA (mascRNA), 
the tRNA-like transcript originating from the 3'-end of MALAT1 
(Wilusz et al., 2008). In contrast, the Menp tRNA-like small RNA 
was found to be subjected to CCACCA addition and efficiently 
degraded (Sunwoo et al., 2009; Wilusz et al., 2011). Unlike the 
acceptor stems of canonical tRNAs and mascRNA, the Menp 
acceptor stem is destabilized through mismatches or wobble 
base pairs, somehow prompting the CCA-adding enzyme to 
repeat its catalytic cycle, thereby adding tandem CCA motifs (Wi- 
lusz et al., 2011). The CCACCA tail then serves as a degradation 
signal for the cellular RNA decay machinery. 

It is now clear that CCA-adding enzymes from all three king- 
doms of life survey the stability of their tRNA-type substrates 
and selectively add either CCA or CCACCA. For example, 
many bona fide tRNAs that are destabilized through mutations 
or the lack of proper modifications are subjected to CCACCA 
addition (Wilusz et al., 2011). The CCA-adding enzyme is thus 
not only critical for generating functional tRNAs, but also likely 
plays a universal and central role in tRNA and tRNA-like small 
RNA surveillance and quality control. 

Here, we investigated how the CCA-adding enzyme distin- 
guishes structurally stable from unstable RNAs so as to only 
mark unstable RNAs with CCACCA. We also unravel how a 
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second CCA cycle can occur despite the enzyme possessing a 
unique mechanism that normally ensures a single cycle of CCA 
synthesis. We find that after the first CCA cycle, nucleotide bind- 
ing to the active site induces the enzyme to apply torque on 
the RNA. A clockwise screw motion of the enzyme’s catalytic 
domain leads to RNA compression and overwinding. This 
causes unstable RNAs to extrude a distinctively positioned bulge 
from the acceptor stem while still bound to the enzyme. The 
bulge does not perturb the double-helical nature of the substrate 
and all other structural determinants near the active site remain 
in place. The catalytic mechanism is thus preserved between 
the first and second cycles of CCA addition (Pan et al., 2010; 
Tomita et al., 2006; Xiong and Steitz, 2004). In total, we find 
that tandem CCA addition is not the result of a modified enzy- 
matic activity that is particular to unstable RNAs. Rather, it is a 
consequence of the natural activity of the CCA-adding enzyme 
on a substrate with increased conformational flexibility. By ex- 
ploiting the versatility of RNA structure, the CCA-adding enzyme 
is able to trigger the degradation of potentially detrimental small 
RNAs and tRNAs. 

RESULTS 

Catalysis Is Crucial for the CCA-Adding Enzyme 
to Detect Unstable RNAs 

Certain full-length tRNAs that are hypomodified or contain muta- 
tions that destabilize their acceptor stems have previously been 
shown to be subjected to CCACCA addition (Wilusz et al., 2011). 
Using the CCA-adding enzyme from Archaeoglobus fulgidus 
(AfCCA), we found that tRNA minihelices, which contain only 
the acceptor stem and TOC stem-loop, are also efficiently sub- 
jected to CCACCA addition when they have guanosines at the 
first and second positions as well as a destabilized acceptor 
stem by virtue of mismatches and GU wobbles (Figure SI A 
available online). To understand how the CCA-adding enzyme 
determines whether a substrate is to receive CCA or CCACCA, 
we first used an arginyl-tRNA TCG minihelix that contained a 
CA mismatch (C3-A70) in its acceptor stem due to a mutation 
at position 70 (Figure 1 A). This unstable tRNA minihelix (miniUR) 
was co-crystallized with AfCCA and the structure of the complex 
was determined by molecular replacement and refined to 2.95 A 
resolution (Table SI). 

The unstable minihelix is bound between the enzyme’s cata- 
lytic center, comprised of the head and neck domains, and its 
tail domain, which serves as a ruler to ensure that only tRNAs 
and tRNA-like transcripts are substrates for the enzyme (Fig- 
ure 1 A) (Tomita et al., 2004; Xiong and Steitz, 2004). As observed 
previously, the discriminator base (G73) at the RNA 3'-end ex- 
tends the stacking of the minihelix and is not rotated out of the 
helical trajectory (Tomita et al.,2006). However, in contrast to To- 
mita et al. (2006), which described helical distortion in the middle 
of the minihelix, we find that the unstable tRNA minihelix 
perfectly mimics full-length tRNA with its acceptor and TOC 
stems folding into a continuous A-type RNA helix. Furthermore, 
the TOC loop is in the same conformation as in full-length tRNA, 
likely because two nucleotide fragments (derived from the 
mother liquor after nucleolytic cleavage) aid proper folding by 
mimicking nucleotides 17 and 18 of full-length arginyl-tRNA TCG 



(Figure SI B). The C3-A70 mismatch is seamlessly incorporated 
into the RNA helix and adenine and cytosine face each other, 
albeit further apart, despite their hydrogen bonding incompatibil- 
ity (Figure 1 B). This larger distance is thus not sufficient to notice- 
ably distort the bound minihelix and, therefore, unlikely to distort 
a full-length tRNA. 

To understand at which step the enzyme is able to detect 
unstable RNAs, we added CTP prior to co-crystallization. The 
enzyme catalyzed the addition of two C-nucleotides to the ar- 
ginyl-tRNA TCG minihelix, but diffraction was poor. Nevertheless, 
we found that using a minihelix already containing the first C 
nucleotide (C74) improved diffraction to 3.2 A (miniUR-C+CTP) 
(Table SI). In this case, the enzyme catalyzed the incorporation 
of one additional C-nucleotide, resulting in a minihelix ending in 
-CC. Strikingly, the addition of CTP resulted in a completely re- 
modeled tRNA acceptor stem (Figures 1C and ID). The original 
discriminator base G73 moved down by three positions to base 
pair with C3, resulting in a three-nucleotide bulge (A70-C72) 
protruding from the RNA helix. Apart from this bulge, which in- 
cludes the originally mismatched (A70) and wobble (U71) bases, 
the RNA conformation was virtually unchanged. Two new base 
pairs G1 :C75 and G2:C74 were formed, resulting in a blunt end 
close to the enzyme (Figure 1C). To determine whether full- 
length tRNA is able to form a bulge in the same position, we 
superimposed full-length tRNA (Protein Data Bank [PDB] code 
1SZ1) (Xiong and Steitz, 2004) onto the miniUR-C+CTP struc- 
ture (Figure SIC). The RNA bulge fits in the space between 
the enzyme and the anticodon arm, strongly suggesting that 
an unstable full-length tRNA can form an RNA bulge in the 
same position. Unfortunately, in the miniUR-C+CTP structure, 
the enzyme was captured in a catalytically impaired state, as 
the incoming nucleotide is not properly positioned in the active 
site. This resembles a complex of the enzyme with either nucle- 
otide alone or with no RNA substrate (Figure SI D) (Tomita et al., 
2006; Xiong et al., 2003). Further addition of ATP to our co-crys- 
tallization experiments did not yield an actively engaged com- 
plex. Although CTP addition results in a catalytically competent 
preinsertion complex when a stable substrate (denoted miniR) is 
used (PDB code 2DR5) (Tomita et al., 2006), we observed dra- 
matic refolding of the RNA after CTP addition to an unstable 
substrate (Figure IE). This clearly demonstrates that catalysis 
is necessary for the CCA-adding enzyme to detect unstable 
RNAs. 

Clockwise RNA Overwinding and Compression upon 
Active Site Closure 

As we were unable to study the second CCA addition cycle by 
simply supplying nucleotides to the enzyme bound to unstable 
RNAs (Figures 1C and ID), we incorporated CCAC at the 3'- 
end of our unstable model tRNA minihelix (miniUR-CCAC). This 
RNA was predicted to fold into a single species with a 3nt bulge 
(Zuker, 2003), reminiscent of our miniUR-C+CTP structure (Fig- 
ure 1C). In the absence of CTP, triclinic, plate-like crystals 
formed, and intensity statistics confirmed pseudo-merohedral 
twinning, accounted for by twin refinement in PHENIX (Adams 
et al., 2010). Addition of CTP (miniUR-CCAC+CTP) eliminated 
twinning and the crystals diffracted to 3.15 A (Table SI). The 
entire RNA is visible in both structures (Figures 2A and S2A). 
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Figure 1. Catalysis Is Crucial to Detect Unstable RNAs 

(A) G70 of the arginyl-tRNA TCG minihelix was mutated to A, resulting in the miniUR transcript that has significant structural instability within its acceptor stem. The 
archaeal CCA-adding enzyme is shown in cartoon representation with the head domain in purple, the neck domain in green, the body domain in blue and the tail 
domain in cyan. Bound RNA is shown in atom colors with oxygens in red, nitrogens in blue, phosphorous in orange and carbons in light blue. 2Fo-Fc density for 
the bound unstable RNA is contoured at 1 a throughout Figure 1. 

(B) Close-up view of the acceptor stem. 

(C) Close-up view of the acceptor stem following CTP addition (miniUR-C+CTP). 

(D) MiniUR before and after CTP addition. Nucleotides labeled in black have similar positions in both structures, the mutation is boxed in red. 

(E) Comparison of the CCA-adding enzyme in complex with a wild-type tRNA minihelix (miniR) (PDB code 2DR5) (Tomita et al. , 2006) or in complex with the 
unstable miniUR minihelix after CTP addition. 

See also Figure SI. 



Prior to nucleotide binding (miniUR-CCAC/open complex), the 
CCA-adding enzyme adopts an open, inactive conformation 
consistent with what we observed for the miniUR complex (Fig- 
ure 1A) and what is seen for stable RNAs (Tomita et al., 2006). 
Confirming the RNA folding prediction, a three nucleotide bulge 
(nucleotides 70 - 72 ) protrudes from the helix and does so in the 



same location as seen after CTP addition to the unstable RNA 
(miniUR-C+CTP) (Figures 1C and ID). 

CTP binding to the active site (miniUR-CCAC+CTP/preinser- 
tion complex) leads to closure of the head domain over the 
RNA double helix (Figure 2A), a process that was previously 
likened to the closure of a clam shell (Tomita et al., 2006). This 
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leads to compression of the RNA by an entire base-pair step 
such that G1 of the preinsertion complex is located in the 
same position as G2 of the open complex (miniUR-CCAC) (Fig- 
ure 2B; Movie SI). The average helical rise from G1 :C72 through 
C48:G64 decreases from 3.2 A to 3 A, accompanied by RNA 
overwinding of 3° per base-pair step (Movies SI and S2). Unlike 
a clam shell, our data suggest that active site closure exhibits 
a clockwise screw motion of the entire head domain, leading 
to RNA overwinding (Figure 2C; Movie S2). A similar degree of 
overwinding was also observed when we reanalyzed active 
site closure over stable tRNA minihelices (Tomita et al., 2006). 
Although significant compression occurs in the acceptor stem, 
the TWC arm of the minihelix is rather strikingly fixed and binds 
the tail domain of the enzyme in a similar manner before and after 
CTP addition (Figure 2A). 

Upon comparing the location of the RNA bulges in the cytosine 
preinsertion complex (miniUR-CCAC+CTP, Figure 2) with two 
different adenosine preinsertion complexes (miniUR-CCACC+ 
AMPcPP and miniUR-CCACC+CTP, Figure S2B), we found 
that the RNA was present in by and large the same conformation 
regardless of how many nucleotides were added at the 3' end 
(Figure 2D). However, the bulges exhibited varying degrees of 
disorder. A surface loop in the head domain (Loopl , residues 
1 1 8-1 26) stabilizes nucleotide 72 (C72) through a weak hydrogen 
bond between N4 of C72 and the carbonyl oxygen of Lys124. A 
second surface loop (Loop2, residues 215-223) in the enzyme’s 
neck domain is oriented toward, but does not bind nucleotide 71 . 
As Loop 1 only interacts with the RNA when it is present in the 
bulged conformation, we reasoned that inserting additional res- 
idues into Loopl may sterically block bulge formation (thereby 
blocking CCACCA addition), but not affect the first CCA addition 
cycle. Three different insertions into Loopl were tested: (1) inser- 
tion of six negatively charged glutamates that are predicted to 
repel the bulged RNA phosphates close to the enzyme, (2) inser- 
tion of three glutamates, and (3) insertion of three large hydro- 
phobic residues (Phe-Leu-Trp) (Figure S2C). Although inserting 
the hydrophobic residues (Loopl -3Hyd) had no effect on tandem 
CCA addition, insertion of negatively charged glutamates 
(Loopl -6Glu and Loopl -3Glu) completely blocked the second 
CCA-addition cycle, whereas the first CCA triplet was readily 
added (Figure 2E). As our model arginyl-tRNA TCG minihelix con- 
tains the wobble base pair G2:U72, which may disproportionally 
favor refolding, we confirmed that two additional minihelices 
lacking this wobble pair gave identical results (Figure S2D). 

We conclude that bulge formation is critical for tandem CCA 
addition and RNA surveillance of unstable RNAs. Furthermore, 
RNA compression and overwinding, induced by the head do- 
main acting as a “molecular screwdriver,” appears to be a way 
for the CCA-adding enzyme to select for proper substrates by 
challenging the stability of the substrates. 

Menp tRNA-like Small RNA Behaves Like an Unstable 
tRNA and Is Held onto the CCA-Adding Enzyme through 
Ionic “Tweezers” 

Upon comparing cytosine selection and incorporation during the 
first and second CCA cycle, we noticed a strict conservation of 
the catalytic mechanism between the two cycles (Figure S3A). 
Since the same holds true for adenosine incorporation (Fig- 



ure S2B), we were surprised to observe a kinked minihelix in 
the adenosine preinsertion complex of AfCCA bound to a human 
Menp-derived minihelix (miniMp-CCACC+AMPcPP) (Figures 3A 
and S3B). Despite diffraction to 2.6 A (Table SI), the entire 3nt 
RNA bulge and the adjacent base U54 are invisible. In addition, 
G4 shows two distinct conformations (Figures 3A and 3B) at 
the center of a clear 1 1 ° kink in the helical axis of the Menp mini- 
helix (Figure 3B). Nevertheless, the two ends of the Menp minihe- 
lix, the three 5'-terminal “acceptor-stem” base pairs and the 
“TT^C-arm” base pairs, superimpose perfectly with the equiva- 
lent features of the unstable arginyl-tRNA TCG minihelix structure 
(miniUR-CCACC+AMPcPP) (Figure 3B). Since the kink essen- 
tially uncouples the acceptor stem from the TWC arm, it seemed 
that the tail domain may not actually be anchoring the RNA on 
the enzyme, as has been previously proposed (Cho and Weiner, 
2004; Tomita et al., 2006), during the second cycle of CCA 
addition. 

To gain insights into whether a kinked Menp tRNA-like sub- 
strate conforms to the normal CCA-adding enzyme mechanism, 
we plotted the temperature-factor distribution of the entire mini- 
Mp-CCACC+AMPcPP complex (Figures 3C and S3C). Strikingly, 
the most stable region of the entire complex is the active site cleft 
and the adjacent top part of the acceptor stem. The tail domain 
and the TWC arm, on the other hand, are highly mobile. Analo- 
gous results were obtained for canonical tRNA minihelices un- 
dergoing the second CCA cycle (miniUR-CCACC+AMPcPP 
and miniUR-CCACC+CTP), which all exhibit identical crystal 
packing (Figure S3D). In contrast, the tail domain and the TWC 
arm are both stable during the first CCA addition cycle (Fig- 
ure S3E). Therefore, it appears that the tail domain does not 
serve as the major RNA anchor during the second CCA addition 
cycle. 

Why then is the top of the acceptor stem so stable? Upon 
calculating the electrostatic surface potential of the CCA-adding 
enzyme, we noticed two positively charged surface patches 
binding the top of the acceptor stem (Figure 3D). Patchl , con- 
sisting of body domain residues Gln296, Arg299, Arg302, and 
Lys402, binds the 5'-end of Menp via direct ionic interactions 
to bridging RNA phosphates. Patch2 is comprised of Arg129 
from the head domain and the catalytically essential Arg224 
from the neck domain. Interestingly, most contacts on this 3'- 
side are water-mediated and, therefore, weaker than on the 5'- 
side (Figure 3D). To address whether those ionic interactions 
keep the unstable RNA in place for the second CCA-addition 
cycle, we mutated both Arg299 and Arg302 to alanine and tested 
enzymatic activity in vitro (Figure 3E). While the wild-type en- 
zyme was able to extend the unstable arginyl-tRNA TCG minihelix 
to CCACCA at a nucleotide concentration of 0.5-5 fiM, the 
mutant enzyme required 10-100X higher concentration before 
switching from CCA to CCACCA addition (Figure 3E). These re- 
sults are consistent with Patchesl+2 forming “ionic tweezers” 
(Figure 3D, schematic) that ensure the second CCA cycle pro- 
ceeds efficiently. 

Apart from a kink in the acceptor stem helix, we conclude that 
tandem CCA addition proceeds in a highly similar fashion for ca- 
nonical tRNAs and tRNA-like small RNAs. Furthermore, it is clear 
that the tail domain of the CCA-adding enzyme has two separate 
functions. First, it serves as a ruler to ensure selective binding of 
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tRNA-type substrates. Second, it allows torque to be applied on 
the helix upon active site closure and thereby contributes to the 
discrimination between stable and unstable RNA substrates. 

On-Enzyme RNA Refolding Permits the Second CCA 
Addition Cycle 

As the CCA-adding enzyme is renowned for its precision in ter- 
minating synthesis after a single CCA triplet, we were eager to 
determine how it transitions from the first to the second cycle of 
CCA addition. For unstable RNAs, we hypothesized that this 
transition may occur in one of two ways: (1) the RNA may 
dissociate from the enzyme after the first round of catalysis 
and re-bind in the bulged conformation to initiate the second 
cycle or, alternatively, (2) the RNA may refold while bound to 
the enzyme. 

Initial support for the latter model came from determining the 
structure of AfCCA in complex with the mutant arginyl-tRNA TCG 
minhelix ending in CCACCA that was bromouridine ( Br U)-labeled 
at nucleotides 54, 63, and 71 (miniUR-CCACCA) (Figure 4A). Two 
strong anomalous peaks were detected at the expected posi- 
tions for Br U54 and Br U63 (Figure 4A). Surprisingly, we also 
observed a weak anomalous peak for Br U71 near the top of the 
acceptor stem. This result was unexpected since U71 falls within 
the location of the bulge, which was disordered in this structure. 
The only reasonable explanation is the presence of two alterna- 
tive RNA conformations, one representing the RNA prior, and 
one after refolding. Indeed, we were able to refine the non-re- 
folded state of the RNA to an occupancy of 30%, thereby 
providing an explanation for the weak anomalous peak. The re- 
maining 70% is disordered in the bulge. Considering that the 
electron density for the tRNA 5'-end was unambiguous in this 
structure (as in all other structures of this study) and that the 
tRNA 5'-end is tightly bound by the enzyme (Figures 3C-3E), 
we hypothesized that the tRNA 3'-end refolded independently 
of the 5'-end to allow addition of the second CCA. 

To test this model, we used two different approaches: first by 
covalently tethering the RNA to the enzyme, and second using a 
competition experiment. For the first, we were inspired by earlier 
work on DNA (Huang et al., 1998) and we set out to reversibly 
crosslink the CCA-adding enzyme to the unstable tRNA using 
a site-specific link. We reasoned that if tandem CCA addition 
proceeded efficiently when the RNA substrate and CCA-adding 
enzyme were covalently joined, refolding most likely occurred 
“on-enzyme.” 



To engineer a covalent complex with a site-specific linkage, a 
purine nucleoside analog bearing a short linker terminating in a 
free thiol was incorporated at position 56 of the TWC arm of sta- 
ble (WT) and unstable (mutant) arginyl-tRNA TCG minihelices (Fig- 
ure S4A) (Peacock et al., 2011). The TWC arm seemed to be the 
best location for not interfering with catalysis, as much of the rest 
of the minihelix is compressed prior to catalysis (this study and 
Tomita et al., 2006). Likewise, Asp351 and Arg344 of the AfCCA 
tail domain are located closest to the crosslinkable nucleotide 
and were therefore replaced with cysteines to allow a protein- 
RNA disulfide crosslink to form, a prediction we confirmed ex- 
perimentally (Figure S4B). 

Since tRNA surveillance through CCACCA addition is con- 
served across all kingdoms of life (Wilusz et al., 2011), we also 
wanted to include class II CCA-adding enzymes. Given a lack 
of sufficient structural information on class II enzymes, the struc- 
ture of the full-length human mitochondrial CCA-adding enzyme 
was determined to 1 .9 A (Figure S4C). Although the tail domain 
was disordered in Augustin et al. (2003), it is clearly resolved 
in our structure, and we were able to construct a model of a 
tRNA-bound complex (Figure 4C). Based on this model and 
allowing for conformational changes in the protein, several 
cysteine mutants were tested for their ability to form disulfide 
bonds with the crosslinkable tRNA. Two mutants, G364C 
and G379C, crosslinked efficiently to both stable and unstable 
tRNA minihelices (Figure 4D). 

We next purified these crosslinked complexes to test whether 
RNA refolding occurs on the enzyme in a standard CCA addition 
assay (Shi et al., 1998b). Reactions were performed in the pres- 
ence of 2 (iM cold ATP and CTP, combined with either radioac- 
tive ATP or CTP to measure A or C incorporation, respectively. 
Since the minihelices used ended in -CC (miniUR-CC), further 
C incorporation served as a sensitive readout of refolding, as 
it should only occur during the second CCA cycle, after the 
substrate has successfully completed the first cycle. A stable 
tRNA minihelix (miniR-CC) was used as a control as it should 
terminate synthesis after the first CCA cycle and thus not incor- 
porate CTP. 

When we measured radioactive ATP incorporation onto the 3'- 
end of stable and unstable human complexes, we detected a 
lower molecular weight product for the stable tRNA complex 
and a higher molecular weight product for the unstable tRNA 
complex, as would be expected for CCA versus CCACCA addi- 
tion (Figure 4E, lanes 1 and 2). Surprisingly, similar results were 



Figure 2. Clockwise Screw Motion and RNA Compression upon Active Site Closure 

Structural transitions upon active site closure. 

(A) RNA and protein domains of the open complex (miniUR-CCAC) are shown in cartoon representation in gray. The preinsertion complex (miniUR-CCAC+CTP) is 
colored as in Figure 1 A with miniUR-CCAC+CTP RNA, including the incoming CTP, in orange. Catalytic magnesium ions are in magenta. The clockwise screw 
motion of the head domain and RNA is indicated. 

(B) Close-up view of the acceptor stem of miniUR-CCAC (gray) and miniUR-CCAC+CTP (orange). The A70 mutation is boxed in red. RNA compression is 
indicated by an arrow. 

(C) Top-down view of (B) including the miniUR-CCAC+CTP (3-hairpin (purple). Clockwise RNA rotation is indicated. 

(D) Location of the RNA bulge in all three preinsertion complex structures of the second CCA cycle. Common nucleotides to all structures are numbered in black, 
unique ones follow the RNA color scheme. Loops 1 and 2 are shown for miniUR-CCAC+CTP only. 

(E) In vitro CCA-addition assays were performed to assay the ability of AfCCA containing Loop 1 insertions to incorporate [a- 32 P] ATP onto stable or unstable 
(G70A) arginyl-tRNA TCG minihelices ending in -C. The insertion of three or six glutamates abolished the ability of the CCA-adding enzyme to mark unstable RNAs 
for degradation by adding CCACCA. 

See also Figure S2 and Movies SI and S2. 
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Figure 3. Unstable RNA Is Held onto the CCA-Adding Enzyme through Ionic “Tweezers” 

(A) Structure of the adenosine preinsertion complex of the Men(3 tRNA-like small RNA (miniMp-CCACC+AMPcPP), as in Figure 1A. The disordered RNA bulge is 
dashed and colored in pink. 

(legend continued on next page) 
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obtained using radioactive CTP (Figure 4E, lanes 3 and 4), despite 
our expectation not to detect CTP incorporation into stable 
tRNAs ending in -CC. Upon reversing the crosslink by adding 
20 mM DTT prior to performing the CCA addition assay, CTP 
incorporation was detected only onto the unstable tRNA, indi- 
cating that the stable tRNA now completed only the first CCA cy- 
cle (Figure 4E, lanes 7 and 8). Interestingly, all RNA products were 
smaller under reducing (Figure 4E, lanes 5-8) versus oxidizing 
conditions (Figure E, lanes 1-4), with the products under 
reducing conditions running at the same sizes as never-cross- 
linked reaction controls (Figure 4E, lanes 9 and 10). We therefore 
conclude that crosslinking the human CCA-adding enzyme to its 
RNA substrate resulted in tandem CCA addition to stable RNAs 
and three CCA triplets being added to unstable RNAs. Similar 
results were obtained with the crosslinked archaeal enzyme, 
although the difference between stable and unstable RNA was 
less pronounced (Figure S4D). We should note that upon cross- 
linking the substrate to either enzyme and increasing the nucleo- 
tide concentration to 50 fiM, both enzyme classes behaved like 
“oligo-CCA”-adding enzymes and added extended CCA tails 
to both stable and unstable RNAs, with unstable RNAs harboring 
longer tails on average (Figure S4E). This “oligo-CCA” addition is 
not observed in vivo (Wilusz et al., 201 1) since refolding and CCA 
addition rates are much higher for crosslinked complexes 
compared to those of the uncrosslinked versions. 

For the competition experiment, we reasoned that if the RNA 
refolded between the two CCA-addition cycles while bound to 
the enzyme, a pre-bound tRNA would not be competed off the 
enzyme by another RNA when transitioning from the first to the 
second CCA cycle. We, therefore, pre-bound unstable full- 
length arginyl-tRNA TCG (G70A) ending in its discriminator base 
to either the human or the archaeal CCA-adding enzyme in the 
presence of CTP only. After having incorporated two C nucleo- 
tides, we expected the enzymes to remain bound to their sub- 
strate RNAs, awaiting ATP to complete the first CCA cycle. 
ATP was then added simultaneously with an excess of full-length 
human Menp tRNA-like small RNA already ending in CCA, and 
product formation was monitored. Consistent with the tRNA re- 
folding on enzyme, pre-binding of unstable arginyl-tRNA TCG to 
both classes of enzymes rendered the enzymes “immune” to- 
ward excess RNA and resulted in similar amounts of mature 
arginyl-tRNA TCG product, irrespective of the competing Menp 
concentration (Figure 4F, lanes 1-4). In contrast, when both ar- 
ginyl-tRNA TCG and Menp substrates ending in -CC were added 
at the same time, final products were predominantly synthesized 
for Menp, the substrate present in excess (Figure 4F, lanes 5-8). 
We note that human Menp is a better substrate for the human 
CCA-adding enzyme compared to the archaeal enzyme (Fig- 



ure 4F). Also, when the archaeal reaction scheme was inverted 
such that human Menp was pre-bound and arginyl-tRNA TCG 
served as the competitor, we observed a less pronounced trend, 
since Menp is not as good a substrate as arginyl-tRNA TCG 

(Figure S4F). 

Taken together, it is clear that tandem CCA addition does not 
involve RNA dissociation. Instead, RNA refolding takes place 
“on-enzyme.” 

Active Site Closure Gauges RNA Stability and Triggers 
Refolding 

What triggers “on-enzyme” RNA refolding to enable the second 
CCA cycle? Termination of the first CCA cycle, which precedes 
refolding, has previously proven difficult to study. It is, however, 
known that in the termination state, the entire CCA triplet stacks 
onto the acceptor stem and abuts against the p sheets contain- 
ing the active site residues, thereby blocking the addition of 
further nucleotides (Xiong and Steitz, 2004). 

Insight regarding how refolding is triggered was gained from a 
2.7 A structure of a monoclinic crystal of the Menp tRNA-like 
small RNA adenosine preinsertion complex (miniMp-CCACC+ 
AMPcPP-i). Most of the RNA bound in the active site pocket of 
this complex is ordered, despite the pocket not being entirely 
closed (Figures 5A and S5A). As expected for a substrate ending 
in CCACC, the tRNA-like transcript is bound in its refolded state, 
ready for the second CCA cycle to be completed with the addition 
of an A. Nucleotides C59 and C60 (equivalent to C74 and C 75 for 
tRNAs) form strong base pairs to G1 and G2.A61 (A76 for tRNAs), 
the last A of the first CCA triplet, is in a similar position to a 
discriminator base. However, its base edge projects toward a 
P-hairpin that has been previously shown to be critical for catal- 
ysis and proofreading (Cho et al., 2005). This A also stacks with 
His97, thereby displacing Asp96 and Ala95. C62 (C 77 for tRNAs), 
the first C of the second cycle, packs against Tyr99, whereas C63 
(C78) folds back onto the 3'-end of the acceptor stem, lying 
orthogonal to it and tucking against Arg129 (Figures 5A and 
S5A). Although electron density for the C63 base is weak, its 3'- 
hydroxyl group is clearly not in a position to attack the a-phos- 
phate of the incoming nucleotide. Interestingly, the incoming 
AMPcPP is also not properly locked into its binding pocket as 
its base edge is not recognized by Arg224, nor do the three phos- 
phate groups coordinate metal ion A. 

When we compared this structure to the proper adenosine 
preinsertion complexes of Menp (miniMp-CCACC+AMPcPP, 
Figure 3A) and unstable tRNA (miniUR-CCACC, Figure S2B), 
we realized that the observed state represents a functional inter- 
mediate between the open (inactive) and the closed preinsertion 
complexes. The bound small RNA exhibits an intermediate 



(B) Comparison between the miniM(3-CCACC+AMPcPP and miniUR-CCACC+AMPcPP RNA structures. Both RNAs are schematically diagrammed on the right 
with disordered nucleotides in gray. The kink in miniM(3-CCACC RNA is sketched. 

(C) Temperature factor distribution of miniMp-CCACC+AMPcPP. Low temperature factors of 20 A 2 are in blue, intermediate values in yellow, and values above 
80 A 2 in red. 

(D) Electrostatic surface potential of the archaeal CCA-adding enzyme. Blue depicts positively charged, white neutral, and red negatively charged areas. The two 
positively charged patches holding the acceptor stem are indicated. MiniM(3 RNA is in cartoon and colored in pink. The close-up view details the ionic interactions 
that glue the top of the acceptor stem to the enzyme. A schematic of the “ionic tweezers” is shown. 

(E) In vitro CCA-addition assays using wild-type or mutant AfCCA and the unstable (G70A) arginyl-tRNA TCG minihelix (miniUR) substrate. Weakening the strong 
interactions between the RNA 5'-end and the enzyme impaired the second CCA-addition cycle, whereas the first cycle was unaltered. 

See also Figure S3. 
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Figure 4. On-Enzyme RNA Refolding Permits the Transition from the First to the Second CCA Cycle 

(A) The two interchangeably bound RNA conformations observed in bromouridine ( Br U)-labeled miniUR-CCACCA co-crystals. Br U and RNA instabilities are 
highlighted in red. 

(B) The double-helical part of the non-refolded RNA from (A) is shown in cartoon representation and colored in gray. Br U are in red. The anomalous difference 
Fourier map is colored in green and contoured at 3.5 a. 

(C) Structure of the full-length human mitochondrial CCA-adding enzyme. Protein domains are colored according to the archaeal enzyme in Figure 1 A. The newly 
resolved tail domain is highlighted. A tRNA minihelix is modeled according to PDB code 1 VFG (Tomita et al., 2004). An ethylthio-purine crosslinker was incor- 
porated at tRNA position C56 and crosslinked to the CCA-adding enzyme carrying a G379C mutation. 



(legend continued on next page) 
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degree of compression and the (3-hairpin displays an intermedi- 
ate degree of closure (Figure S5B). Similar to our observations 
for unstable tRNA (Figure 2), the transition from the intermediate 
to the preinsertion complex revealed a coordinated clockwise 
screw motion of the head domain, resulting in RNA overwinding 
and compression of the acceptor stem by an entire base pair 
step (Figure 5B; Movies S3 and S4). The head domain pushed 
the protruding nucleotide A61 of the intermediate state back 
into its proper position to stack onto the 3'-end of the acceptor 
helix, unstacking it from His97. The (3-hairpin is now in a proper 
conformation to proofread C62, which undergoes a 90° rotation, 
whereas C63 reorients its 3'-hydroxyl toward AspllO, which 
serves as a general base during adenosine incorporation (Pan 
et al., 201 0). The incoming AMPcPP moves to its proper preinser- 
tion position, interacting with Arg224 and coordinating Metal B 
along with the catalytic residues Glu59 and Asp61 . Interestingly, 
only the Men(3 preinsertion complex shows a kinked helix, likely 
induced by complete active site closure (Figures 3A and S5C). 

The apparent correlation between active site closure and RNA 
compression indicates that apart from being essential for catal- 
ysis, active site closure is the way by which the stability of the 
RNA is assessed. Following the first CCA addition, the torque 
applied on the RNA double helix essentially gauges weak spots 
in the duplex and causes refolding of the RNA in the form of 
extrusion of a bulge. 

Nucleotide Binding Is Critical for Triggering RNA 
Substrate Refolding 

Three lines of evidence suggested that nucleotide binding is crit- 
ical to trigger refolding. First, our ability to isolate an intermediate 
complex is likely attributed to the presence of a sub-optimally 
positioned nucleotide in the active site pocket. Proper binding of 
the triphosphate moiety locks the enzyme in the catalytically 
active preinsertion state. Second, the termination state after the 
first CCA cycle (PDB 1 SZ1) (Xiong and Steitz, 2004) has sufficient 
space for an incoming nucleotide, which could start off the second 
CCA-addition cycle (Figure 5C). Finally, since active site closure is 
accompanied by a clockwise rotation, we reasoned that the termi- 
nation complex is formed from the adenosine preinsertion com- 
plex by a counter-clockwise motion of the head domain, which 
appears to be the case (Figure 5C). This movement is essential 
to reset the enzyme prior to a new cycle of CCA addition. 

As nucleotide binding is critical to trigger refolding, we hypoth- 
esized that increasing the nucleotide concentrations in standard 
CCA-addition assays (Shi et al., 1998b) may lower the threshold 
of instability needed in a tRNA acceptor stem for CCACCA addi- 
tion to occur. Using a nucleotide concentration of 0.5 fiM, we 



recapitulated what is seen in vivo for full-length wild-type or 
mutant arginyl tRNA (Figure 5D). However, at nucleotide concen- 
trations above 5 jiM, the wild-type tRNA was converted into a 
tandem CCA target by both classes of CCA-adding enzymes 
(Figure 5D). Similar results were obtained for minihelix substrates 
(Figure S5D). At nucleotide concentrations above 20 piM, we de- 
tected even further extension of the CCA tails, reminiscent of the 
results obtained with crosslinked complexes (Figure 4E). Most 
surprisingly, both enzyme classes converted stable tRNAs into 
CCACCA targets at identical nucleotide concentrations, which 
was unexpected since tRNA approaches their respective active 
sites from entirely different directions (Xiong et al., 2003). Where- 
as the extruded RNA bulge tucks against the protein for class I 
enzymes (Figure 2), it almost certainly protrudes into the solvent 
in class II enzymes (Figure S5E). 

Additional tRNAs were subsequently tested to determine 
whether sequence elements within the tRNA may affect the abil- 
ity of nucleotide concentration to regulate CCA versus CCACCA 
choice. As in our model arginyl-tRNA TCG substrate, wild-type 
cysteinyl-tRNA GCA contains a wobble base pair in its acceptor 
stem and was converted into a tandem CCA target at a nucleo- 
tide concentration of 5 fiM (Figure S5F). On the other hand, wild- 
type tRNAs with acceptor stems containing only Watson-Crick 
base pairs required nucleotide concentrations to be at least 
ten times higher (50 (iM) before they were converted into 
CCACCA targets (Figures S5G and S5H). 

We suggest that the choice between CCA versus CCACCA 
addition is made when a nucleotide binds to the active site in 
the termination state following the first CCA addition cycle. For 
stable substrates, clockwise active site closure triggered by 
nucleotide binding leads to RNA dissociation as the energetic 
cost of breaking multiple Watson-Crick base pairs is too high. 
Unstable RNAs, however, will refold “on-enzyme” and a second 
CCA triplet will be added. Elevated nucleotide concentrations 
shift the reaction equilibrium toward tandem CCA addition with 
the actual concentration wherein this switch occurs dependent 
on acceptor stem stability. Elevated nucleotide concentrations 
might favor a closed active site, which would trap even tran- 
siently refolded stable substrates after the first CCA cycle on 
the enzyme by adding the first C of the second cycle. 

RNAs with Longer Bulges Are Not Readily 
Accommodated by the CCA-Adding Enzyme 

Why, however, is CCA addition limited to a tandem triplet in vivo? 
Although extended tails longer than CCACCA are efficiently 
added under certain conditions in vitro (higher nucleotide concen- 
trations or when the RNA is covalently bound to the enzyme), they 



(D) Silver-stained SDS-PAGE of the crosslinked human CCA-adding enzyme and the ethylthio-purine-derivatized minihelices. Crosslinked complexes show 
retarded gel mobility, with the G379C and G364C mutants showing the highest crosslinking efficiency. 

(E) In vitro CCA addition assays with crosslinked complexes. The wild-type and G70A mutant arginyl-tRNA TCG minihelices are denoted as stable and unstable, 
respectively. Reactions 2-5 were under oxidizing, whereas reactions 6-10 were under reducing conditions. Radioactive nucleotides are indicated above the 
lanes. Unstable minihelices that had never been crosslinked were used as controls (reactions 9-10). 

(F) In vitro competition experiments. In lanes 1-4, full-length arginyl-tRNA TCG (G70A) was pre-bound to the human and archaeal CCA-adding enzyme in the 
presence of 2 [xM CTP before adding cold ATP supplemented with [a 32 P]-ATP and varying amounts of the hMEN(3 transcript (up to a 20-fold excess). In lanes 5-8, 
both RNAs were added simultaneously along with ATP. An equal amount of arginyl-tRNA TCG was added in all lanes, whereas hMEN (3 amounts varied as denoted 
at the top. Pre-bound RNA was efficiently extended to CCACCA irrespective of added excess RNA, indicating that the bound RNA refolds “on-enzyme.” When 
added simultaneously, both RNAs compete for binding to the CCA-adding enzyme. 

See also Figure S4. 
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Figure 5. Active Site Closure Gauges RNA Stability and Triggers Refolding 

(A) The intermediate adenosine preinsertion complex of Men(3 (miniM(3-CCACC+AMPcPP-i) shown as in Figure 1 A. Residues contributed by the head domain are 
in atom colors with carbons in purple. The disordered residues Ala95 and Asp96 are dashed in purple. Arg224 from the neck domain is in atom colors with carbons 
in green. A dashed circle highlights the missing Metal B. Improper positioning of the incoming nucleotide is shown with black dashes. 

(B) Superposition of the adenosine intermediate and preinsertion complexes (miniM(3-CCACC+AMPcPP). The intermediate state RNA and AMPcPP are in gray, 
while the intermediate (3-hairpin and Aspl 1 0 are in purple. Intermediate Arg224 is in green. The entire preinsertion complex (miniM(3-CCACC+AMPcPP) is in pink. 
Three proofreading interactions, the AMPcPP base edge recognition and the Aspl 1 0 general base contact to the 3'-hydroxyl are shown with green dashed lines 
for the preinsertion complex, as is Metal B bound to AMPcPP. 

(C) Comparison of the miniM|3-CCACC+AMPcPP complex and the termination complex of the first CCA cycle (PDB code 1SZ1) (Xiong and Steitz, 2004). The 
incoming AMPcPP is depicted for Men(3. Canonical tRNA numbering is used for Men(3 to simplify comparison. The active site residues that the RNA stacks against 

(legend continued on next page) 
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Figure 6. Model of RNA Surveillance by the CCA-Adding Enzyme 

The head and neck domains of the CCA-adding enzyme are shown as abstract bodies colored in purple and green, respectively, with the (3-hairpin as an oval. 
Bound unstable RNA is shown in orange with its compression state changing during the CCA-addition cycles. Steps 1-10 are described in the text. 

See also Figure S6. 



are only rarely added in vivo (Wilusz et al., 201 1 , 2012). To under- 
stand why, we carried out fragment-based assembly of RNA with 
full atom refinement (FARFAR), a ROSETTA-based de novo RNA 
prediction algorithm (Das and Baker, 2007; Das et al., 2010). Us- 
ing the miniUR-CCAC+CTP complex as a template (Figure 2), 
5,000 structures were calculated assuming either a 3nt (for tan- 
dem CCA) or a 6nt bulge (for three CCAs). As expected, models 
containing a 6nt bulge deviated more significantly from the tem- 
plate structure and none formed a continuous helix (Figure S6A). 
In contrast, about a third of the models with a 3nt bulge exhibited a 
continuous helix (Figure S6B). 

We posit that in order to apply torque onto the acceptor stem 
to trigger refolding, a continuous helix would be necessary. From 
these calculations, it appears that the reason that three CCAs are 
not accommodated is because the resulting RNA molecules are 
unable to form continuous helices. 

DISCUSSION 

Model of Tandem CCA Addition 

tRNA surveillance through tandem CCA addition is a universally 
conserved mechanism by which the cell distinguishes between 



stable and certain unstable tRNAs and tRNA-like transcripts (Wi- 
lusz et al., 201 1). While stable tRNAs receive only CCA, unstable 
tRNAs beginning with GG are marked with CCACCA with near 
100% efficiency and rapidly degraded (Wilusz et al., 2011). We 
show that the structural flexibility of RNA controls the choice be- 
tween CCA versus CCACCA addition and propose the following 
model for tandem CCA addition (Figure 6): (1-3) a proper tRNA- 
type substrate is first recognized by the CCA-adding enzyme. 
The discriminator base is inserted between the head and neck 
domains, while the Ti|iC loop interacts with the enzyme’s tail 
domain (Figure 1A). The first CCA addition cycle then proceeds 
as has been established (Pan et al., 2010; Tomita et al., 2006; 
Xiong and Steitz, 2004). Following the addition of the terminal 
A, pyrophosphate is released and the head domain rotates 
counter-clockwise to allow continuous stacking of the CCA triplet 
against the head domain (Figure 5C). (4) Nucleotide binding at 
this stage induces the clockwise closure of the active site with 
torque applied on the RNA duplex (Figures 2 and 5). This “inter- 
rogation” of the RNA by the enzyme results in two possible out- 
comes: (5) if the bound RNA is stable, it will dissociate from the 
enzyme since breaking multiple stable Watson-Crick base pairs 
for RNA refolding is unfavorable; (6) however, if the RNA is 



in the termination complex are shown as sticks. The counter-clockwise twist between the Men(3 preinsertion complex and the tRNA termination complex is 
indicated. 

(D) In vitro CCA addition assays with full-length stable and unstable (G70A) arginyl-tRNA TCG and either the human or the archaeal CCA-adding enzyme. Cold 
nucleotide concentrations (ATP+CTP) were increased in the presence of radioactive CTP. CCA tail length is indicated. 

See also Figure S5 and Movies S3 and S4. 
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unstable, the screw motion will trigger on-enzyme refolding of the 
RNA 3'-end, in the form of extrusion of a bulge, while the 5'-end 
and the TijiC loop remain bound to the enzyme (Figure 4). Refold- 
ing thus leads to a structure that is reminiscent of a misaligned 
zipper with the bulged nucleotides protruding from an otherwise 
continuous RNA helix (Figures 2 and 3). This bulged substrate is 
stabilized by two new base pairs that form between the Cs added 
during the first CCA cycle and the 5' -terminal Gs and further sta- 
bilized by ionic interactions to the enzyme (Figure 3) as well as in- 
teractions between the bulge and loop 1 of the head domain. 
These interactions help lower the energy for the refolded confor- 
mation. (7-8) Once the active site is closed over the refolded 
RNA, the second cycle of CCA addition proceeds analogous to 
the first. Upon termination, the second CCA triplet again stacks 
against the head domain following a counter-clockwise rotation. 
(9) Nucleotide binding could then, in principle, result in yet 
another round of RNA refolding (10). However, RNA refolding at 
this stage would result in a 6nt bulge, a structure that is unable 
to accommodate a continuous helix, which is required for the 
RNA to remain bound to the enzyme (Figure S6). Therefore, 
the screw motion for active site closure and torque applied on 
the RNA following the second CCA-addition cycle induces disso- 
ciation of the RNA carrying CCACCA at its 3' end. 

Proofreading Unstable RNAs through Their Structural 
Versatility 

Proofreading is an important feature of all polymerases. As DNA 
and RNA polymerases scan along a template strand while syn- 
thesizing a product strand, they recognize misincorporated 
nucleotides or erroneous templates either before or immediately 
after nucleotide incorporation (Kunkel and Bebenek, 2000; 
Sydow and Cramer, 2009). These transcriptional obstacles are 
usually removed through endonucleolytic cleavage before 
nucleotide synthesis resumes. 

In contrast, the CCA-adding enzyme uses proofreading on two 
different levels. First, it is able to add the nucleotides CCA in a 
template-independent manner to the 3' ends of all tRNAs and 
tRNA-like molecules without translocating along its substrate 
(Pan et al., 2010; Toh et al., 2008; Tomita et al., 2006; Xiong 
and Steitz, 2004). Second, the enzyme exploits the structural 
versatility of unstable RNAs to mark them for degradation, 
even though the mutation or instability is never directly recog- 
nized by the active site of the enzyme. The enzyme simply sticks 
to its substrate requirements and catalytic principles regardless 
of the substrate. If a continuous A-form RNA helix of proper 
length is bound between the head and tail domains, and an un- 
paired discriminator base can be properly positioned in the 
active site, then the CCA-adding enzyme will add a CCA triplet 
to the RNA 3'-end. Rather than dissociating after the first CCA 
cycle, unstable RNAs are refolded in response to the regular 
screw motion of the enzyme during active site closure. CCA- 
addition terminates once the RNA substrate is unable to remain 
bound to the enzyme. We speculate that tandem CCA addition is 
the unavoidable consequence of CCA addition to a large subset 
of unstable RNAs beginning with GG, since refolding is triggered 
during catalysis. As it is driven by the RNA substrate itself and is 
beneficial for removing erroneous tRNAs, this mechanism is 
conserved across both classes of CCA-adding enzymes. 



We suggest that RNA compression after completion of the first 
CCA cycle, which challenges RNA stability and triggers refolding 
of unstable RNAs, is an example of proof-reading via an energy 
relay mechanism, first proposed by Hopfield (1980). RNA com- 
pression as seen in the dynamic intermediate state of the 
enzyme is at a branch point, characteristic of these types of 
mechanisms. The RNA substrate at this branch point either “re- 
sists” and falls off the enzyme when it contains a stable acceptor 
stem, or “buckles,” extruding a bulge, and then receives another 
CCA when the acceptor stem is weakened due to a mismatch. 
The compression of the RNA provides the energy relay for proof- 
reading in this case. 

Our study elucidates how non-coding RNAs (ncRNAs) that 
utilize the tRNA fold are either marked for rapid degradation 
or stabilized by the addition of a single CCA triplet. Since 
bona fide tRNAs possess very long half lives (Kanerva and 
Maenpaa, 1981), we speculate that CCA-addition is a way 
for ncRNAs to increase their stability. Interestingly, while the 
Menp tRNA-like small RNA is rapidly degraded due to CCACCA 
addition in humans and mice, the Old World Monkey homolog is 
stable and thus a CCA target (Wilusz et al., 2011). In the future, 
it will be instructive to study the functional significance of 
these differences. Furthermore, the surveillance mechanism 
presented here is conserved across all kingdoms of life and 
across different classes of RNA. This underlines the centrality 
of RNA structure in regulating transcript stability and function 
(Ding et al., 2014; Rouskin et al., 2014; Wan et al., 2014) and 
calls for a more detailed search into the structural elements of 
IncRNAs. 

EXPERIMENTAL PROCEDURES 

Structures of the Archaeal CCA-Adding Enzyme in Complex with 
Unstable RNAs 

The A. fulgidus CCA-adding enzyme (AfCCA) was expressed in Escherichia 
coli and all RNAs used for crystallography were purchased from Dharmacon. 
Preinsertion complexes were obtained by co-crystallization with AMPcPP or 
CTP. Most reaction steps crystallized in 16%-24% PEG-3350 and 0.2- 
0.27 M sodium-potassium tartrate. Diffraction data were collected at beamline 
X25 of the National Synchrotron Light Source (NSLS) at Brookhaven National 
Laboratory. All structures were solved by molecular replacement using 
PHASER (McCoy et al., 2007) and refined with PHENIX (Adams et al., 2010). 
For details, see Extended Experimental Procedures. 

Structure of the Full-Length Human CCA-Adding Enzyme 

The human mitochondrial CCA-adding enzyme was expressed in E. coii and 
crystallized in 2.3 M ammonium sulfate and 0.5 M tri-sodium citrate pH 5.8. 
Its structure was solved by molecular replacement and refined with PHENIX 

(Adams et al., 2010). For details, see Extended Experimental Procedures. 

Crosslinking Experiments, CCA-Addition Assays 

A 2-amino-(S-trityl-ethylthio)-purine phosphoramidite was synthesized as 
described (Peacock et al., 2011). Crosslinkable RNA was prepared at Chem- 
Genes. All in vitro assays were carried out with crosslinked and soluble com- 
ponents as described (Shi et al., 1998b). For details, see Extended Experi- 
mental Procedures. 

Competition Experiments 

To study the effect of RNA pre-binding, full-length arginyl tRNA TCG (G70A) 
ending in its discriminator base was pre-incubated in the presence of cold 
CTP only. Cold ATP spiked with [a 32 P] ATP was added simultaneously with 
equimolar or excess amounts of full-length human Menp already carrying 



656 Cell 160 , 644 - 658 , February 12 , 2015 ©2015 Elsevier Inc. 




Cell 



CCA. Reactions were allowed to proceed for 2 min for AfCCA and 30 s for the 
human CCA-adding enzyme. As a control experiment, both RNAs were added 
simultaneously along with ATP. For details, see Extended Experimental 
Procedures. 

Rosetta-Based RNA Modeling 

De novo RNA folding was carried out using Rosetta 3.5 (http://www. 
rosettacommons.org) (Das et al., 2010). Detailed procedure in Extended 
Experimental Procedures. 

ACCESSION NUMBERS 

Coordinates and structure factors of the archaeal CCA-adding enzyme com- 
plexes have been deposited in the Protein Data Bank (PDB) under accession 
codes 4X4 N, 4X40, 4X4P, 4X4Q, 4X4R, 4X4S, 4X4T, 4X4U, and 4X4V. The 
full-length human mitochondrial CCA-adding enzyme has been deposited un- 
der the accession code 4X4W. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Extended Experimental Procedures, six 
figures, four movies, and one table and can be found with this article online 
at http://dx.doi.Org/10.1016/j.cell.2015.01.005. 
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Highlights 

• Physical confinement and low adhesion induce the 
mesenchymal-amoeboid transition 

• A large range of slow mesenchymal cell types can display 
fast amoeboid-like migration 

• A fast mode (A1 ) and a faster and more conserved contractile 
mode (A2) were observed 

• A2 migration could be an ancestral migratory behavior 
shared among eukaryotes 
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SUMMARY 

The mesenchymal-amoeboid transition (MAT) was 
proposed as a mechanism for cancer cells to adapt 
their migration mode to their environment. While 
the molecular pathways involved in this transition 
are well documented, the role of the microenviron- 
ment in the MAT is still poorly understood. Here, we 
investigated how confinement and adhesion affect 
this transition. We report that, in the absence of focal 
adhesions and under conditions of confinement, 
mesenchymal cells can spontaneously switch to a 
fast amoeboid migration phenotype. We identified 
two main types of fast migration — one involving a 
local protrusion and a second involving a myosin-ll- 
dependent mechanical instability of the cell cortex 
that leads to a global cortical flow. Interestingly, 
transformed cells are more prone to adopt this fast 
migration mode. Finally, we propose a generic model 
that explains migration transitions and predicts a 
phase diagram of migration phenotypes based on 
three main control parameters: confinement, adhe- 
sion, and contractility. 

INTRODUCTION 

The mechanisms individual cells use to migrate can be divided 
into two distinct types, known as mesenchymal migration and 
amoeboid migration. Cells migrating in a mesenchymal fashion 
typically adopt an elongated, spindle-like shape and exert traction 
on their substrates via focal adhesions associated with actin rich 
protrusions, such as lamellipodia orfilopodia. In contrast, cells un- 
dergoing amoeboid migration adopt round or irregular shapes. 
They undergo cycles of expansion and contraction, which allow 
them to squeeze through gaps in the extracellular matrix. The ve- 
locity of amoeboid migration is often higher than that of mesen- 
chymal migration (Pankova et al., 2010) and it appears to involve 
a range of different sub-modes, such as bleb-based migration 
or gliding (Paluch and Raz, 2013). Consequently, it is not as 
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well-defined at the molecular level as mesenchymal migration, 
although it is thought to rely on an increased activity of the RhoA 
pathway, which activates formin-based actin nucleation and 
myosin II contractility (Sanz-Moreno and Marshall, 2010). 

Despite an extensive literature on the subject, it remains unclear 
whether the various modes of amoeboid migration are mechanis- 
tically distinct from each other, or whether they represent a contin- 
uum of a single process. Recent work, however, has identified 
mechanisms common to several forms of amoeboid migration, 
supporting the continuum model. Several studies report, for 
example, that amoeboid migration is often based on a contractile 
cell rear inducing a retrograde flow in the cell cortex (Heuze et al., 
2013; Paluch and Raz, 2013; Poincloux et al., 2011). Others have 
shown that, in contrast with mesenchymal migration, amoeboid 
migration does not depend on a cell’s ability to form focal adhe- 
sions with its environment (Bergert et al., 2012; Lammermann 
et al., 2008; Renkawitz et al., 2009). In the light of these studies, 
Lammermann and Sixt (2009) have proposed that three key pa- 
rameters modulate the mode of migration of cells, namely protru- 
sion, contractility, and adhesion. They further suggest that by 
altering the balance between these parameters, cells could adapt 
their mode of motility to different migration environments (Lam- 
mermann and Sixt, 2009), allowing migrating cells to negotiate 
the topologies of the different tissues they encounter. 

Migration mode plasticity is an important factor for immune 
cells migration but also in cancer metastasis, as disseminating 
tumor cells have to navigate a range of extracellular matrix ge- 
ometries to escape their point of origin and spread to distant 
sites. Metastasizing cancer cells exhibit both mesenchymal 
and amoeboid modes of migration, and recent studies have 
shown that they can switch between the two, a phenomenon 
known as the mesenchymal-amoeboid transition (MAT) (Friedl 
and Wolf, 2003; Pankova et al., 2010; Taddei et al., 2013; To- 
zluoglu et al., 2013). Importantly, non-migrating cells such as 
epithelial cells can first undergo a transition to mesenchymal 
migration (epithelial to mesenchymal transition or EMT), enabling 
them to develop collective and/or individual migratory behaviors 
associated with cancer cell dissemination (Friedl and Wolf, 2003; 
Taddei et al., 2013). This suggests that, by going through 
sequential transitions, even epithelial cells, could eventually 
display amoeboid motility (Giampieri et al., 2009). 
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Given its importance in metastasis, the MAT has been the focus 
of several recent studies. These have identified key parameters 
driving the transition: cell-intrinsic factors, such as intracellular 
signaling and contractility, and extrinsic, environmental factors, 
such as cell adhesion and the geometry of the extracellular ma- 
trix. Using a protease inhibitor mix to inhibit the matrix remodeling 
ability of mesenchymal cells induces the MAT and is associated 
with a reduction in integrin expression (Carragher et al., 2006; 
Wolf et al., 2003). Tozluoglu et al. (201 3) combined in vivo exper- 
iments and mathematical simulations to highlight cell confine- 
ment as a crucial parameter affecting the efficiency of various 
migration modes. In vitro studies using micro-channels also point 
to an important contribution of confinement in changing the 
requirements for efficient locomotion (Hung et al., 2013). 

Until now, however, no systematic quantitative study has been 
performed to understand the interplay between a cell’s environ- 
ment and its migration behavior. Here, we describe how we sys- 
tematically studied the effects of confinement and adhesion on 
migration modes. Using a variety of cell types, we found that 
slow mesenchymal cells can switch to fast amoeboid-like migra- 
tion under conditions of low adhesion and high confinement. We 
propose a simple generic physical model that produces a phase 
diagram of migration modes and depends on both the intrinsic 
properties of cells and environmental parameters. 

RESULTS 

Adhesion Decreases Speed but Increases Persistence 
in Non-Confined Conditions 

To systematically investigate the effects of adhesion and 
confinement on cell migration, we combined two methods (Fig- 
ure 1) to control cell adhesion (by quantitative controlling of the 



Figure 1. Schematic Diagram of Cell Migra- 
tion Assay 

Cell culture substrates were treated with various 
ratios of pLL-PEG and pLL-PEG-RGD to control 
adhesion of cells with their environment and 
PEG-treated slides containing micro-spacers 
were used to confine cells at a fixed height. 

See also Figure SI . 



density of RGD peptides on the cell sub- 
strate) (Barnhart et al., 2011) and to 

confine cells plated on a 2D substrate 
(Figure SI; Le Berre et al., 2012). 

Normal human dermal fibroblast cells 
(NHDF) migrate in a mesenchymal 
fashion and so were chosen to study. 
We began by seeding NHDF cells on sur- 
faces coated with various concentra- 
tions of pLL-PEG-RGD, but without 
confining them (Figure 2A). As RGD den- 
sity decreased, cell spreading likewise 
decreased. Where RGD density dipped 
below 5%, cells were unable to attach 
to the substrate when gently flushed (Fig- 
ures 2B and 2C). 

To quantify differences in migration behavior, trajectories of in- 
dividual cells (Figure 2D) were analyzed in terms of instantaneous 
speed (S), persistence (P) and diffusivity (D) (Figures E, S2D, 
and S2E; Movie SI; see Extended Experimental Procedures 
for details on calculation). In agreement with previous work by 
others (DiMilla et al., 1993), the long-term displacement of cells, 
which is characterized by their diffusion coefficient, is maximal 
for an intermediate level of adhesion (Figure 2F), due to an in- 
crease in persistence and a decrease in instantaneous speed 
with increasing adhesion (Figures 2G and 2H). These observa- 
tions confirmed the typical mesenchymal migration of these 
fibroblasts. 

Confinement Modifies the Migration Behavior of 
NHDF Cells 

We introduced confinement by covering cells (cultured on 
surfaces of controlled adhesion) with a non-adhesive surface 
(Figure SI). NHDF cells were 7.6 ± 0.36 jam high during inter- 
phase. We therefore chose to limit our investigation to heights 
of either 5 or 3 pm. At 5 pm (low confinement), the cell body 
was slightly deformed. At 3 jam (high confinement), both the 
cell body and the nucleus were deformed (Le Berre et al., 
2012). 

Cells tended to retract when confined, forming fewer large la- 
mellipodial protrusions and more elongated pseudopodia (Movie 
S2; Figure 3A for 5 pm, Figure S3A for 3 pm, and phenotype C in 
Figures 3B-3E). This retraction of the cell margin was probably 
due to an increase in cell cortex contractility, which favors a 
more compact morphology (Sahai and Marshall, 2003). When 
adhesion was low (<15% RGD) and confinement high, cells 
became totally round, mostly immobile, and often showed a 
continuous blebbing activity (Movie S2; phenotype B in Figures 
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Figure 2. Migration Behavior of NHDF Cells on Surfaces of Various Adhesive Levels 

(A) Representative phase contrast images of NHDF cells on surfaces containing various RGD densities. 

(B and C) Quantification of the number of attached cells per mm 2 (B) and their spreading areas (C) for various RGD densities (n > 20 cells, N > 2 experiments for 
each point on the graph). 

(D) Representative trajectories of 60 cells recorded over 24 hr for various RGD densities. The starting positions of each cell were registered to the center of the plot. 

(E) Schematic explaining measurements of cell displacement D(t), mean instantaneous speed (S), and persistence (P) of cells (see Supplemental Information). 



(F-H) Graphs showing diffusion coefficient (F), mean instantaneous speed (G), 
each point on the graph). In (A)-(H), % corresponds to the percentage of pLL- 
and scale bars represent 100 pm. 

See also Figure S2 and Movie SI . 

SB-3E, note in Figure S3 legend), further suggesting an 
increased level of contractility. Unexpectedly, a fraction of these 
rounded NHDF cells could polarize and move fast with an amoe- 
boid-like morphology (Movie S2; phenotype A in Figures 3B-3E). 

Further quantification of the migration of spread cells (pheno- 
type C or E) showed that confinement always reduced the 
instantaneous speed of such cells (Figure 3E), but did not signif- 
icantly change their persistence (Figure S3B). Cells that were 
more rounded did not move, except for a sub-fraction of cells 
that spontaneously polarized (phenotype A). These cells showed 



persistence time (H) of NHDF cells for various RGD densities (n > 1 00 cells for 
l-RGD in both pLL-PEG and pLL-PEG-RGD solution, error bars indicate SEM, 



a much higher diffusion coefficient (up to ten times higher), which 
was not observed under conditions of strong adhesion or without 
confinement. The high diffusion coefficient of cells showing an 
amoeboid-like behavior could be attributed to both an increase 
in instantaneous speed and an increase in persistence. This is 
reminiscent of the behavior of leukocytes, such as dendritic 
cells, which are known to migrate fast and independently of spe- 
cific integrin-based adhesion (Lammermann et al., 2008; Lam- 
mermann and Sixt, 2009; Hawkins et al., 2009; Heuze et al., 
2013). 
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Figure 3. Migration Behavior of NHDF Cells in Response to Varying Confinement Height and Adhesion Strength 

(A) Representative phase contrast images of NHDF cells under 5 [xm confinement and various RGD densities. Scale bar represents 100 |xm. 

(B) Schematic showing the appearance of the observed phenotypes as a function of confinement height and adhesion strength. Phenotypes are defined as 
follows: A, round cell with a leading edge; B, round cell; C, cell with long and non-spread pseudopodia; D, floating, round cell; E, normally spread cell. 

(C) Portion of the different phenotypes observed under various conditions according to cell morphologies defined in (B). n > 100 cells for each conditions. 

(D) Top panel: representative images of cells for each phenotype. Conditions of confinement (^m)/adhesion (% pLL-PEG-RGD) are A: 5/2%; B: 3/15%; C: 3/75%; 
E: no/100%. Bottom panel: corresponding time lapse of the cell outline. The color of the outlines represents elapsed time after the first outline. Scale bar rep- 
resents 50 |xm. 

(E) Graph showing diffusion coefficient (top) and instantaneous speed (bottom) of each NHDF phenotype on varying RGD densities and for different confine- 
ments. Error bars indicate SEM. 

See also Figure S3 and Movie S2. 
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Altogether, these observations show that a fraction of typically 
mesenchymal NHDF cells migrating under conditions of high 
confinement and low adhesion can switch to a fast amoeboid- 
like migration. 

Two Distinct Modes of Amoeboid-like Migration Can Be 
Induced, Depending on Cell Type 

To test whether this microenvironment-dependent switch in the 
mode of migration was restricted to NHDF cells, we investigated 
the migration behavior of 20 other cell types confined by fully 
non-adhesive surfaces (100% pLL-PEG). We chose a sample 
of cell types covering a large range of migratory and non-migra- 
tory cells, including normal and transformed epithelial cells (pre- 
and post-EMT), normal and transformed mesenchymal cells, 
leukocytes (typical amoeboid cells), and muscle cell precursors. 

For all cell types, with the exception of normal epithelial cells, 
we found a sub-fraction of spontaneously polarizing cells moving 
fast with an amoeboid-like morphology (Figures 4A and 4B). Un- 
expectedly, we found two distinct types of fast-migrating cells 
(Figures 4A and S4A-S4C). The first type, which we named A1 , 
was similar to what was observed with NHDF cells: a round 
cell body and a small leading edge (Movie S3). The second, 
which we named A2, had an elongated ellipsoid cell body with 
a large uropod and resembled a migrating neutrophil (Figures 
S4A and S4B; Movie S3). Globally, A2 cells moved significantly 
faster (5.3 ± 1 .5 (im/min) than A1 cells (1 .7 ± 0.4 (im/min) and 
they both moved much faster than mesenchymal controls 
(0.234 ± 0.09 |im/min) (see Figures 4C and 4D). 

The type of fast migration, A1 or A2, varied according to the cell 
line under observation (Figure 4D). For five cell types, the two 
modes could coexist in the same cell population, while most 
cell types preferentially switched to one migration mode or the 
other. Interestingly, different sub-clones of HeLa cells obtained 
from different labs (see Supplemental Information for a descrip- 
tion of cell types) could either preferentially switch to A1 or A2 
(Figure 4D). This suggests that these two migration modes, which 
are morphologically very different, can arise in very similar cell 
types, probably based on subtle molecular differences. 

Transformed Cells, as well as Leukocytes, Preferentially 
Display Fast A2 Migration under Confinement 

Comparing different groups of cell types revealed general trends 
in the transition to amoeboid migration. Normal epithelial cells 
rarely displayed a transition to fast amoeboid migration (Fig- 
ure 4D). This was due to the formation of large cell clusters 
that were mostly non-motile. At lower density, individual cells 
were able to display the A1 migratory phenotype. 

When we induced EMT in MDCK or MCF10A cells with TGFp, 
they no longer formed clusters and displayed fast A2 migratory 
phenotype. This was also true for tumor cell lines of epithelial 
origin from different tissues or in vitro transformed cell lines. 
Similarly, while normal mesenchymal cells tended to display 
mostly A1 migration under confinement, transformed mesen- 
chymal cells often used in MAT studies displayed a high propor- 
tion of A2 migration mode when confined in a non-adhesive 
environment. Surprisingly, even muscle cell precursors dis- 
played this fast migratory behavior under strong confinement. 
As expected, all leukocytes displayed A2-like migration, 



including monocytes, which showed no motility on 2D sub- 
strates due to lack of adhesion. This suggested that the fast 
A2 migration mode was typically adopted by amoeboid cells, 
such as leukocytes, and by transformed cells migrating individ- 
ually, independently of their tissue of origin. 

Altogether, these results show that most slow mesenchymal 
cells can spontaneously switch to a fast amoeboid-like mode 
of migration when they are confined in a non-adhesive 
environment. 

The Switch to Amoeboid-like Migration Depends on the 
Absence of Focal Adhesions 

The switch to amoeboid migration was observed only for adhe- 
sion levels that did not allow cell binding to the substrate when 
no confinement was applied (Figures 2B and 3C). To check for 
the presence of focal adhesions, we imaged HeLa VL cells by to- 
tal internal reflection microscopy (TIRFM) (Trache and Meininger, 
2008), on fibronectin-coated substrates and on PEG-treated 
substrates. On fibronectin, Vinculin-GFP displayed the expected 
pattern of focal adhesions, while on PEG, it was diffuse in the 
cytoplasm (Figure S5A). This was true both with and without 
confinement, showing that confinement did not restore the ca- 
pacity of cells to form focal adhesions on PEG-treated surfaces. 

We then tested whether the ability of PEG-treated substrates 
to promote amoeboid migration relied on a lack of focal adhe- 
sions or on another surface effect of the PEG treatment. For 
this, we confined HeLa VL on an adhesive fibronectin surface 
and prevented formation of focal adhesions by knocking down 
the expression of Talin proteins with small interfering RNA 
(siRNA) or using the pl-integrin-blocking antibody 4B4 (Figures 
5, S5B, and S5F). 

Under confinement on a fibronectin-coated surface, Talin 
knocked-down cells switched to A2 three times more often 
than cells treated with a non-targeting siRNA and as often as 
control cells on a non-adhesive PEG-treated substrate. Similarly, 
treating cells with 4B4 significantly increased the proportion of 
cells switching to amoeboid migration. This confirmed that the 
capacity to switch to amoeboid-like migration depended on 
the absence of focal adhesions and not on the chemical nature 
of the migration substrate or on a particular molecular perturba- 
tion of focal adhesion complexes. 

The Switch to Fast A2 Amoeboid-like Migration Depends 
on High Cell Contractility, whereas A1 Requires Low 
Contractility 

The comparison of a large number of cell types under two 
different levels of confinement suggested that increasing 
confinement led to a larger proportion of A2 migrating cells (Fig- 
ure 4D). We confirmed this effect using three different levels of 
confinement for HeLa VL cells. While most migrating cells 
displayed A1 migration under low confinement, only A2 was 
observed under high confinement (Figure 5B). 

To investigate the role of the myosin II motor protein in the 
behavior of confined cells, we first treated confined HeLa VL 
cells with increasing doses of the phosphatase inhibitor calyculin 
A. This inhibits the dephosphorylation of myosin II, thereby 
increasing its activity (Ishihara et al., 1989). Increasing calyculin 
A concentration increased the fraction of cells switching to A2 
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migration mode, up to 66% of the total population (Figures 5C 
and 5D; Movie S4). We also used siRNA to knock down the 
expression of MYPT1 (Figure S5F), the PP1 partner targeting 
myosin II (Feng et al., 1999). MYPT1 siRNA induced an increase 
in the volume and number of blebs (Figure S5C) and an increase 
in the fraction of cells switching to A2 (Figures 5E and S5D), 
confirming the role of myosin ll-based contractility in this 
phenomenon. 

High-resolution imaging showed that the transition to A2 cor- 
responded to the formation of a large bleb where most of the cell 
cytoplasm, including the nucleus, was transferred (Figure 5F; 
Movie S5), inducing a profound reorganization of the cytoskel- 
eton and resulting in a new stable shape with an elliptical cell 
body and a rear uropod. The formation of such large blebs 
occurred at a constant rate (Figure S5E) and more frequently 
when cells were treated with calyculin A or knocked-down for 
MYPT1 (Figure S5E). Overall, this suggests that confinement 
favors cell contractility and that high cell contractility favors 
fast A2 migration. 

To confirm that hypothesis, we measured the contractility of 
cell populations displaying different fractions of A2 phenotype 
(see Supplemental Information). We plated cells between two 
non-adhesive gels of known rigidity (1 kPa) coated with fluores- 
cent beads (Figure S5H). Using 3D high-resolution imaging, we 
assessed the extent to which the cells deformed the gels and 
so deduced the ability of cells to push on their surroundings, 
as a measure of their relative level of cortical contractility (Fig- 
ure S5I). This measure increased with increasing myosin II activ- 
ity (Figures 5H and S5G), and stronger cells displayed a higher 
fraction of A2 migratory phenotype (Figure 5H). We then assayed 
other cell types and found that cells displaying A2 migration were 
significantly stronger than cells displaying Al migration (Fig- 
ure 51). This confirmed our hypothesis that A2 migration ap- 
peared above a certain threshold of contractility. 

Accordingly, treating cells with the ROCK inhibitor Y27632 
reduced bleb formation and the proportion of cells switching to 
A2. More generally treatment with blebbistatin prevented forma- 
tion of A2 migrating cells in all cell types tested. Unexpectedly, 
however, it did not prevent HeLa cells from moving and instead 
drastically increased the number of cells switching to the Al 
amoeboid-like migration mode (Figures 5C and 5G; Movie S4). 

Together, these experiments show that high contractility 
favored fast A2 migration mode, by increasing blebbing activity, 
eventually leading to the formation of large “stable blebs” 
comprising most of the cell content. Conversely, low contractility 
favored the formation of protrusive structures, leading to Al 
migration. 



A2 Migration Mode Relies on a Fast and Global Cortical 
Retrograde Flow, whereas Al Migration Relies on Local 
Actin Polymerization 

To characterize the migration mechanism of confined non-adhe- 
sive cells, we imaged focal adhesions, actin, and myosin II using 
TIRFM. On the fibronectin-coated surface, cells attached, 
spread, and formed focal adhesions and actin stress fibers con- 
taining myosin II (Figure 6A, upper panel; Movie S6). In A2 cells 
migrating under confinement on a non-adhesive surface, Vincu- 
lin-GFP was homogeneously distributed in the cytoplasm, and 
no stress fibers could be observed. Instead, actin filaments 
assembled into a fibrous cortex that was absent from the cell 
front, but became progressively denser toward the cell rear at 
the uropod (Figure 6A, middle panel; Movie S6). Myosin II formed 
patches in A2 cells, mostly on the rear part of the cell where the 
actin cortex was denser. Both actin filaments and myosin II 
patches showed a strong retrograde flow in the central part of 
the cell (15.8 ± 3.7 (im/min SD n = 1 1 , measured in the reference 
frame of the cell), whereas almost no actin movement was 
observed in the rear of the cell and in the uropod (Figure 6A bot- 
tom panel; see also Movie S7 and Figure 6B for flow visualization 
by particle image velocimetry). 

To assess whether Al motility relied on a similar mechanism, 
we added 30 jiM of the ROCK inhibitor Y27632. As with the A2 
cells, we found that motile single Al cells displayed a fast retro- 
grade flow (14.63 ± 0.9 (im/min). Unlike the A2 cells, however, 
the flow was not global, from the front to the back of the cell. 
Instead, it was localized only in a small protruding region at the 
leading edge of the cell (Figures S6A and S6B; Movie S8). The 
limited area of the retrograde flow might explain why these cells 
were slower than A2 cells. 

Several studies have shown that such a retrograde flow is 
responsible for cell movement independently of adhesion in 
amoeboid cells (Bergert et al., 2012; Renkawitz et al., 2009). 

To confirm the role of the retrograde flow in cell movement, we 
also compared retrograde flow speed with cell speed and found 
a significant correlation between these two parameters (Fig- 
ure S6E). However, retrograde flow was faster than the cell 
speed, indicating that a limited amount of friction occurs be- 
tween the flowing cortex and the substrate surface (Renkawitz 
et al., 2009). In A2 cells lacking a uropod, the cell could go as 
fast as the retrograde flow (Figure S6F; Movie S8), and on 
average, A2 cells were 1 .84 time faster (1 .32-2.58 at 95% confi- 
dence) after being detached from their uropod (see also Figures 
S6C and S6D). These results confirmed that A2 cells moved via a 
coupling of a fast acto-myosin retrograde flow with the substrate 
and that the uropod acted as a dragged passive body. 



Figure 4. Strong Confinement in a Non-Adhesive Environment Can Induce Two Distinct Modes of Amoeboid Migration 

(A) Time-lapse sequences of a typical amoeboid RPE1 (left) and HeLa VL (right) cell under 3-^m confinement on a non-adhesive PEG surface. Grey is phase 
contrast and blue is nuclear staining (Hoechst). Time is in min ('). Scale bar represents 20 [xm. 

(B) Fraction of each phenotype for various cell lines under 3-^m confinement on a non-adhesive surface (n = 1 40, 1 47, 267, 1 29, 60, 1 38, 1 37, 1 1 5, and 112 cells). 

(C) Instantaneous migration speed for various cell lines depending on their migratory phenotype (n = 13, 13, 13, 13, 13, 13,22,6, 10, 1 1 , 10, 2, 9, 3, 10, 46, and 11). 
*p < 0.05, **p < 0.01 according to t test. 

(D) Comprehensive table indicating the presence or absence of amoeboid phenotypes for various cell lines: blue indicates RPE1 -like (Al ), red indicates HeLa-like 
(A2), purple indicates both (A1+A2), and gray indicates none. Cells are sorted by their origins and state of transformation. Species are human (h), mouse (m), and 
dog (d). Origins are primary (p), immortalized (i), and tumoral (t). 

See also Figure S4 and Movie S3. 
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In A2 Cells, the Stable and Global Retrograde Flow Is 
Induced by Myosin II Contractility and Asymmetrical 
Actin Turnover 

To better understand how global retrograde flow could be main- 
tained and how actin was recycled in the cell, we calculated 
the polymerization dynamics of filamentous actin. We first 
measured the ID retrograde flow of actin and myosin II along 
the length of cells (see Extended Experimental Procedures for 
details). In individual cells, the mean rate of flow (in the reference 
frame of the moving cell) was similar for both proteins (Figure 6C). 
In terms of protein localization, however, myosin II patches ap- 
peared to lie further toward the back of the cell (Figure 6D; Movie 
S6). Interestingly, we noticed that myosin II was localized where 
the actin retrograde flow accelerated (Figure 6D, bottom-right 
graph) and that the gradient of speed of the actin filaments, which 
accounts for the actin network contraction, was similar to the 
mean myosin II density along the cell (Figure 6D). This suggested 
a role for myosin II in accelerating the retrograde actin flow. 

To produce a stable cortical flow, cortical actin has to be re- 
cycled. According to the well-accepted model for actin filament 
turnover (Wilson et al., 2010), new actin monomers are polymer- 
ized at the cell front and filamentous actin is severed at the back 
of the cell. As in Wilson etal. (2010), we were able to use the mass 
conservation law to deduce the rate of polymerization of filamen- 
tous actin along the cell length (see Supplemental Information for 
details). We found that actin polymerized at the front at a constant 
rate and depolymerized at the back (Figure 6E). A similar analysis 
of myosin II dynamics showed that recruitment of this protein was 
reduced toward the cell front, suggesting that it was rapidly 
captured by actin filaments and moved toward the rear of the cell. 

Based on these elements, a mathematical model can be pro- 
posed (summarized in Figure 6H and Supplemental Information) 
(see also Ruprecht et al. [201 5] in this issue of Cell ; Callan-Jones 
and Voituriez, 2013; Hawkins et al., 2011), which predicts 



appearance of a motile “stable bleb” state above a threshold 
of contractility of the acto-myosin cortex. 

To further test this model, we assessed the respective roles of 
actin turnover and myosin II contractility in A2 migrating cells. 
When cells were treated with low concentrations of jasplakino- 
lide, a drug that interferes with actin filament turnover, an 
increasing fraction of the actin filaments was found to accumu- 
late at the cell back (Figure 6F). Above 250 nM, the proportion 
of A2 cells dropped (Figure 6F). Treated cells also displayed an 
unstable and floppy front, showing that cortex turnover was 
essential for the stability of the A2 mode. To test the role of 
myosin II during steady-state migration, we introduced the 
drug into the device after the A2 cells had formed. After treat- 
ment, cells concomitantly lost their elongated shape to become 
round and stopped moving (Figure 6G). Myosin II activity was 
thus not only required for spontaneous polarization, but also, 
together with actin filament turnover, to maintain the cortical 
flow, the shape, and the motility of the A2 mode. 

All these observations are consistent with our model and are 
identical to the observations made on embryonic fish cells 
reported by Ruprecht et al. (2015) in this issue of Cell. This 
suggests a common generic mechanism for the migration of 
contractile confined cells which are not able to form acto- 
myosin stress fibers and instead form a contractile cortex. 
When a cell exceeds a certain level of contractile activity, its cor- 
tex can be destabilized by blebs and switch spontaneously to 
another stable state that induces a fast migration behavior (Fig- 
ure 61). Conversely, when a cell is only moderately contractile, its 
cortex is constantly being destabilized by blebs but does not 
switch to a new stable configuration, leading to non-migrating, 
blebbing cells. When contractility is completely inhibited, the 
cortex is not destabilized, and a cell can develop more protru- 
sive activity, which eventually leads to the Al type amoeboid- 
like migration. 



Figure 5. Migratory Transitions Depend on Myosin II Activity and on the Presence of Focal Adhesions 

(A) Left panel: representative images of HeLa VL on a fibronectin-treated substrate after treatments against adhesion: by Talins depletion (TLN1+TLN2) 
with siRNA compared with a non-targeting control (NTC) or by blocking (31 integrins with the 4B4 antibody (30 [xg/ml). Right panel: fraction of A2 cells 
after confinement on a non-adhesive surface (PEG) or a fibronectin-coated surface (Fn) with or without treatments against focal adhesions (n = 482, 313, 687, 
240, 424, and 436 cells). Scale bar represents 50 [xm. All confined cells are under 3-|xm confinement and error bars are either SEM for means or error margin 
for %. 

(B) Fraction of each phenotype for FleLa VL cells depending on confinement level on a non-adhesive PEG surface (n = 308, 254, and 1 40 cells). All confined cells 
are under 3-|xm confinement and error bars are either SEM for means or error margin for %. 

(C) Representative images of HeLa VL cells without and with 100 nM calyculin A or 30 ^M Y27632 treatments under 3-|xm confinement on a non-adhesive PEG 
surface. Scale bar represents 50 |xm. All confined cells are under 3-|xm confinement and error bars are either SEM for means or error margin for %. 

(D) Fraction of A2 cells for various calyculin A concentrations, in HeLa VL cells (n = 140, 327, 208, 140, and 185 cells). All confined cells are under 3-jxm 
confinement and error bars are either SEM for means or error margin for %. 

(E) Fraction of HeLa VL cells transitioning to the A2 phenotype after MYPT1 depletion with three distinct sequences of siRNAs or after a non-targeting control 
(NTC) treatment (n = 664, 637, 408, 211, and 177 cells). All confined cells are under 3-[xm confinement and error bars are either SEM for means or error 
margin for %. 

(F) Time-lapse sequences of a HeLa Kyoto cell showing transition from a rounded blebbing cell (phenotype B) to A2 phenotype. Images are an overlay of phase 
contrast (gray), MYH9-GFP fluorescence (green, staining myosin II), and Hoechst nuclear staining (red). Time is in hr:min. Scale bar represents 25 ^im. 

(G) Fraction of HeLa VL cells transitioning to Al and A2 phenotypes after 30 [xM Y27632 treatment (n = 140 and 191 cells). All confined cells are under 3-[xm 
confinement and error bars are either SEM for means or error margin for %. 

(H) For various HeLa clones or after treatment with Y27632 (30 ^M) or calyculin A (50 nM), fraction of A2 cells compared with the mean contractility index of cells 
<h/d>, which quantifies the ability of detached cells to deform a soft non-adhesive polyacrylamide (PA) gel of 1 kPa. For each point: n > 200 cells on y axis and 
n > 15 on x axis. 

(I) Contractility index of various cell types showing either Al or A2 migration (n = 19, 22, 18, 20, 17, 19, 18, 19, 17, 18, 17, and 15; p values are from a non- 
parametric Mann-Whitney test). 

See also Figure S5 and Movies S4 and S5. 
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Figure 6. Motility of A2 Fast-Moving Phenotype Is Based on a Strong Actin Retrograde Flow 

(A) Representative TIRF microscopy images of HeLa cells expressing both Vinculin-GFP (left) and LifeAct-mCherry (right) (HeLa VL), and HeLa cells expressing 
MYH9-GFP (middle) under the following two conditions: on fibronectin-coated adhesive surface without confinement (top panel) and confined under 3 |am on a 
non-adhesive PEG surface (middle panel). Bottom images correspond to kymographs showing retrograde flow in the cell body. Orange dashed lines show cell 
outlines, red dashed lines were used to generate kymographs. Horizontal bars are 10 and vertical bars are 2 min. 

(legend continued on next page) 
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DISCUSSION 

A Phase Diagram of Cell Migration Can Be Drawn Based 
on Only Three Parameters 

Amoeboid migration has been described as a fast migration 
mode potentially independent of adhesion and involving a 
rounded cell shape. However, no consensus on its mechanism 
has been established. Here, we showed that physical confine- 
ment and low adhesiveness could promote a mesenchymal-to- 
amoeboid transition across a range of cell types that have not 
been thought to be capable of amoeboid motility. In addition, 
cortical contractility determined the type of amoeboid migration, 
leading to a general phase diagram for single cell locomotion, 
based on only three main parameters: adhesion, confinement, 
and contractility (Figure 7B). 

Below, we discuss in more detail how these three parameters, 
either individually or in conjunction, might contribute to the 
migratory transitions that we observed. 

The Absence of Focal Adhesion Allows the Formation of 
a Cell Cortex and a Switch to a Rounded State 

An intriguing aspect of the transition to amoeboid migration 
modes is that, in all cases, it required the absence of focal ad- 



Figure 7. Schematic of the Proposed Model 
Describing Transitions between Modes of 
Cell Migration 

(A) Increasing cortical tension and decreasing 
cortex-surface interaction both converge toward a 
round, “non-wetting” shape of the cell. 

(B) Summary of experimental observations. 

(C) Generic phase diagram resulting from models 
of cortical wetting and cortical stability. 



hesions. We found that when cells were 
still able to form focal adhesions, they 
formed persistent protrusions and 
stress fibers, all of which prevented 
the formation of large-scale cortical 
flows. This observation is consistent 
with previous reports that integrin 
expression inhibits amoeboid migration 
(Carragher et al., 2006). Focal adhesions 
and the cell cortex might compete to re- 
cruit the acto-myosin contractile machinery, leading to either 
contractile stress fibers or a contractile/flowing cortex. In the 
absence of focal adhesions, cortical tension can trigger the 
mechanical delamination of remaining adhesive sites, leading 
to a rounded shape. To mathematically account for this tran- 
sition from a spread, adherent phenotype to a round shape at 
low adhesion and high contractility, we can draw an analogy 
with the dewetting of liquid droplets on flat surfaces (see Sup- 
plemental Information). This qualitatively accounts for the 
observed transition from phenotype E to B in Figure 7. This 
analogy, although speculative at that stage, is useful to com- 
plete the theoretical framework recapitulating our observa- 
tions. It captures the importance of the balance between 
contractility and adhesion. 

Importantly, adhesion alone does not prevent the transition to 
amoeboid migration, but rather its association with the formation 
of stress fibers. As a consequence, cell types that could adhere 
to particular matrix components without generating stress fibers 
could also move with a flowing cortex, leading to amoeboid 
migration even in the presence of a strong coupling between 
the actin cortex and the cell environment. This type of motility 
has been observed in leukocytes, in cancer cells invading a 
soft non-crosslinked matrix such as Matrigel (Poincloux et al., 



(B) Particle image velocity (PIV) of actin flows (bottom image) recorded on a TIRF movie (top image). Dashed part corresponds to areas where signal was too weak 
for flow calculation. Scale bar represents 10 |xm. 

(C) Mean retrograde flow along the cell (in the reference frame of the cell). Right images show speed distribution measured for 1 5 min on an individual cell and left 
graph shows mean flow averaged over several cells, x = 0 corresponds to the position of the neck. 

(D) Mean actin (based on LifeAct staining) and myosin II density profiles. Right images show 2D mean densities in the cell body averaged for 15 min on a single cell 
and left graph shows mean density profile for several cells, myosin II density colocalized with actin retrograde flow gradient grad(v). 

(E) Polymerization rates estimates along the cell body for myosin II and actin. Inferred from density and retrograde flow profiles shown in (C) and (D). 

(F) Left: actin localization (inverted image of LifeAct fluorescence) in a HeLa VL A2 cell with or without jasplakinolide treatment. Right: fraction of A2 cell function of 
jasplakinolide concentration (for each bar n > 278 cells, error bars are error margins). Scale bar represents 20 ^m. 

(G) Time-lapse sequence of a A2 HeLa VL cell after a 150 fxM blebbistatin treatment (left) and quantification of the aspect ratio of the cell front and cell speed 
before and 60 min after treatment (n = 7 cells, error bars are SEM) (right). 3-[xm confinement, PEG. Scale bar represents 50 [xm. 

(H) Schematic of the proposed model for cell motion (see main text and Supplemental Information). 

(I) Mean cell shape: superposition of the outlines of 22 cells showing the reproducibility and stability of cell shape in mode A2. Scale bar represents 10 [xm. In 
(C)-(E), n = 5 (actin), six (myosin II) cells, error envelop is SEM and 0 on x axis corresponds to the neck position. Point a in (A). 

See also Figure S6 and Movies S6, S7, and S8. 
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2011), and in embryonic cells from zebrafish (Ruprecht et al. 
[2015 in this issue of Cell). 

Confinement Is Required for Transmission of Forces in 
the Absence of Adhesion, but It Also Increases 
Contractility 

When mesenchymal cells are prevented from forming focal ad- 
hesions, they cannot hold on to fibronectin- or collagen-coated 
substrates. This automatically makes confinement a require- 
ment for cell migration. This phenomenon has been termed 
“chimneying” (Malawista et al., 2000), in reference to a tech- 
nique used by mountain climbers. Exactly how these frictional 
forces are transmitted from the acto-myosin cortex to the migra- 
tion substratum remains an open question. In addition, it has 
been proposed that confinement, in more complex geometries, 
combined with blebbing activity and an intrinsic cell polarity, 
could allow complete non-frictional motion to occur (Tozluoglu 
et al., 2013). In conclusion, the most direct function of confine- 
ment is to enable force transmission in the absence of adhesions 
to the substrate. 

An unexpected effect of confinement was to significantly in- 
crease contractility, thus favoring blebbing and the A2 mode of 
migration. As this effect was immediate, it could not rely on the 
activation of a transcriptional program. Force-induced signaling 
via focal adhesions could not explain this effect either, as cells 
did not form adhesive structures. This suggests that the increase 
in contractility observed upon confinement might be induced 
directly by cell and/or nuclear deformation through a yet- 
unknown mechanism. 

A Major Difference between Al and A2 Migration Modes 
Is the Origin of the Polarization Process that Triggers 
Cell Movement 

An intrinsic polarization factor or process in the cell, or an 
extrinsic spatial cue can induce a local retrograde flow in a pro- 
trusive structure (Lorentzen et al., 2011). Indeed, we observed 
that cells switched to the Al migration mode after making con- 
tact with cell debris or with other cells, or after a division. This 
indicated that this mode of migration only rarely appeared spon- 
taneously and relied on a polarization factor to locally trigger the 
retrograde flow and thus the cell movement. 

Cell polarity can also be induced in the absence of a specific 
polarization signal, a phenomenon previously described in other 
studies (Lorentzen et al., 201 1 ; Poincloux et al., 201 1). We found 
that this spontaneous polarization of a contractile cell cortex, 
which is predicted by our minimal model based on active 
gel theory (see Supplemental Information), can be nucleated 
randomly by the blebbing activity (see also Ruprecht et al. 
[2015 in this issue of Cell). Following destabilization of the cor- 
tex and transfer of the cytoplasm to the large bleb, contractility 
remains concentrated at the back of the cell due to the global 
flow, generating a stabilizing feedback. This can trigger an 
increase in pressure in the cell, a phenomenon reminiscent of 
lobopodial migration (Petrie et al., 2012, 2014). The movement 
of the global actin retrograde flow can also be transmitted to 
the substratum. 

This phenomenon of a steady-state flow based on actin fila- 
ment turnover is essentially similar to the mechanism driving pro- 



trusive lamellipodial migration (Wilson et al., 2013) or Al cell 
motility. But, importantly, while Al motility requires an additional 
mechanism for polarization, in the case of high contractility (A2 
mode), we provided evidence, both theoretical and experi- 
mental, that a stable polarization and thus a persistent migration 
can occur spontaneously. 

A2 Migration: A Generic, Ancestral Mode of Cell 
Locomotion? 

For all cell lines studied, the speed of amoeboid-like migration 
was much faster than the speed of the mesenchymal form and 
corresponded to values reported for amoeboid cell types 
(several (im/min). This suggests that these typical speeds are 
rather linked to the mode of migration than to the type of cell, 
as a given cell type can adopt both types of migration and asso- 
ciated speed, depending on its migration environment. 

While we could observe two different types of fast amoeboid 
migration, the A2 mode appeared to be more general. Most 
cell types were able to adopt it, if contractile and sufficiently 
confined. Strikingly, all cell lines displayed a unique shape and 
speed when in the A2 mode, resembling the shape of a migrating 
neutrophil, with a pronounced uropod and a smooth rounded 
leading edge and with a similar range of speed. This was inde- 
pendent of their tissue of origin, of their physiological function, 
and of the shape and speed they displayed when migrating on 
an adhesive substrate without confinement. Importantly, embry- 
onic cells from zebrafish could also display the same general 
shape and range of speed (Ruprecht et al. [2015] in this issue 
of Cel!) . 

When we investigated the acto-myosin dynamics underlying 
the migration of these cells in more detail, the similarity appeared 
to be even more striking. Interestingly, the stereotypical 
“sausage” shape of the A2/stable bleb migrating cells has also 
been reported for amoebae, with also a similar actin and myosin 
II distribution (Yoshida and Inouye, 2001). This suggests that the 
transition to the A2, or “stable-bleb” fast amoeboid-like migra- 
tion might be a very general phenomenon, relevant not only for 
cultured cancer cells but also for primary embryonic cells, both 
in vitro and in vivo. 

From a mathematical point of view, the transition to A2 
migration mode can be understood with a very generic mini- 
mal ID theoretical model. The fact that such a simple and 
generic model can predict the transition to A2/stable bleb 
migration suggests that this mode of locomotion could be 
viewed as a basic property of a dynamic and contractile actin 
cortex. Consistent with this view, we often observed cell frag- 
ments of various sizes, resulting from shedding of large blebs, 
moving with the same shape, speed, and actin dynamics as 
A2 cells. 

Taken together, the generality of the A2/stable bleb migration 
mode, the strong conservation of its main features (cell shape, 
cell speed, acto-myosin distribution, and dynamics) across a 
variety of cell types and organisms ranging from amoebae to 
vertebrates, the generic aspect of the model describing it, 
and the key feature of spontaneous polarization, make it 
tempting to speculate that this mode of migration may be an 
ancestral locomotion mechanism for crawling cells, shared by 
eukaryotes. 
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A2 Migration Mode Is Related to Cell Transformation 

An interesting result emerging from the comparison of various 
cell types (Figure 4D), is that the capacity to display amoeboid 
motility seems to correlate with the transformed character of 
the cells assayed. This suggests an increased migration plas- 
ticity of cancer cells. 

Importantly, the various migration modes can be observed in a 
single cell line by slightly varying the confinement level, adhesion, 
and contractility. As a consequence, some physical characteris- 
tics of the tumor environment, such as the confinement imposed 
by the surrounding tissue on the growing tumor, might be enough 
to trigger a migratory switch toward a faster and more invasive 
mode of migration. Such an increase in the migration and invasion 
capacity of tumor cells has been observed using encapsulated 
spheroids as a model for tumor growth under confinement 
(Alessandri et al., 2013). This phenomenon could also explain 
how inhibition of matrix-degrading enzymes such as MMPs could 
directly promote a switch to amoeboid migration by increasing 
the confinement of the growing tumor and/or of single migrating 
cells (Wolf et al., 2003). In conclusion, our work demonstrates 
an unexpected plasticity of migration of transformed mesen- 
chymal-like cells when they are not in their physiological physical 
environment, suggesting that tumor cells could spontaneously 
adopt a large variety of strategies to escape primary tumors and 
invade tissues, without any specific genetic alteration. 

EXPERIMENTAL PROCEDURES 

In brief (see Supplemental Information for a detailed description of the 
methods), cell culture and experiments were performed in complete cell culture 
media adapted for each particular cell type. Drugs and RNAi treatments were 
performed following standard procedures. Cell confinement was performed 
using a home-made device (Le Berre et al., 2012) consisting of a suction cup 
made in polydimethylsiloxane (PDMS, RTV615, GE) used to press a confining 
coverslip bearing PDMS micro-spacers on top of the culture substrate platted 
with cells. Alternatively, a version of the cell confiner adapted to multi-well 
plates was used to perform multiple experiments in parallel. The molds for 
the PDMS micro-spacers were fabricated following standard photolithography 
procedures. The surface of the confining side was always treated with non- 
adhesive pLL-PEG (SuSoS), while the cell culture substrate was treated to 
obtain various degrees of cells adhesion using mixes of pLL-PEG and pLL- 
PEG-RGD. For automated cell tracking, cell nuclei were stained with Hoechst 
33342. Image acquisition and image analysis were performed on workstations 
of the PICT-IBiSA Lhomond Imaging facility of Institut Curie. All microscopes 
used for time-lapse recordings were equipped with an on-stage incubation 
chamber which maintained the temperature at 37°C and C0 2 concentration 
at 5% at all times. Image analysis was performed using Image J or MetaMorph 
software (Universal Imaging), or homemade routines under MATLAB (Math- 
Works). Analysis of cell trajectories was based on standard methods for anal- 
ysis of persistent random walks. To measure single cell contractility index, cells 
were plated between two poly-acrylamide gels coated with beads and the cell 
dimensions as well as the gel deformation were measured by confocal micro- 
scopy. HeLa VL cells were always used as a reference for gel calibration. 
A minimal 1 D theoretical model, based on the observations of the acto-myosin 
flows analysis, was adapted from Callan-Jones and Voituriez (2013) and 
Hawkins et al. (201 1) to predict the mesenchymal to amoeboid transition. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Extended Experimental Procedures, six 
figures, and eight movies and can be found with this article online at http:// 
dx.doi.org/1 0. 1 01 6/j.cell.201 5.01 .007. 
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SUMMARY 

3D amoeboid cell migration is central to many devel- 
opmental and disease-related processes such as can- 
cer metastasis. Here, we identify a unique prototypic 
amoeboid cell migration mode in early zebrafish em- 
bryos, termed stable-bleb migration. Stable-bleb cells 
display an invariant polarized balloon-like shape with 
exceptional migration speed and persistence. Progen- 
itor cells can be reversibly transformed into stable- 
bleb cells irrespective of their primary fate and motile 
characteristics by increasing myosin II activity through 
biochemical or mechanical stimuli. Using a combi- 
nation of theory and experiments, we show that, in 
stable-bleb cells, cortical contractility fluctuations 
trigger a stochastic switch into amoeboid motility, 
and a positive feedback between cortical flows and 
gradients in contractility maintains stable-bleb cell 
polarization. We further show that rearward cortical 
flows drive stable-bleb cell migration in various adhe- 
sive and non-adhesive environments, unraveling a 
highly versatile amoeboid migration phenotype. 

INTRODUCTION 

Migrating cells show a versatile repertoire of migration modes 
with remarkable plasticity, allowing them to switch between 
different migration strategies in response to changing environ- 
mental conditions and activation of distinct molecular pathways 
(Friedl and Alexander, 2011). In order to migrate, cells need to 
establish an axis of polarity prior to movement. This polarity ulti- 
mately manifests itself in a polarized architecture of the acto- 
myosin network, which in turn drives cell locomotion through 
different mechanical principles: in mesenchymal migration, the 
cortical actomyosin network facilitates unidirectional movement 
via polarized actin polymerization at the leading edge, combined 
with myosin-based contraction at the cell rear to disassemble 
adhesion sites. Amoeboid cells, in contrast, show hetero- 



geneous shape and motility characteristics with actin-based 
protrusions, such as lamellipodia and pseudopodia and contrac- 
tion-mediated protrusions, such as cellular blebs (Lammermann 
and Sixt, 2009). Recent studies have suggested that propulsive 
forces in amoeboid cells are generated by cortical contractility 
and retrograde cortical flows (Blaser et al., 2006; Poincloux 
et al., 2011; Shih and Yamada, 2010), allowing movement even 
in the absence of specific adhesive coupling to the environment 
(Lammermann and Sixt, 2009; Tozluoglu et al., 2013). 

During zebrafish gastrulation, progenitor cells become motile 
and undergo extensive migration to form the ectoderm, meso- 
derm, and endoderm germ layers. While ectodermal progenitors 
assemble in a pseudo-epithelial cell layer, mesodermal and 
endodermal (mesendodermal) progenitor cells display a highly 
motile mesenchymal phenotype with a mixture of lamellipodia 
and bleb-like protrusions (Row et al., 2011). Interfering with the 
ratio of those protrusion types has been shown to lower the direc- 
tionality but not the speed of their migration (Diz-Munoz et al., 
2010). Besides mesendodermal progenitors, primordial germ 
cells (PGCs) also undergo extensive migration during gastrulation 
but nearly exclusively use bleb-like protrusions for their migration 
(Blaser et al., 2006). Although using different protrusion types, 
migration speed and directionality of PGCs and mesendodermal 
progenitors appear surprisingly similar (Blaser et al., 2006; Diz- 
Munoz et al., 201 0), raising questions as to the choice and benefit 
of certain protrusion types over others for the migration of the 
different progenitor cell types during gastrulation. 

Here, we have studied different migration phenotypes during 
zebrafish gastrulation and identified a cortical contractility-medi- 
ated cell-intrinsic motility switch to fast amoeboid migration in 
3D environments, which we termed stable-bleb migration. 

RESULTS 

Identification of Basic Migration Modes in Zebrafish 
Germ Layer Progenitor Cells 

To study the emergence of migration competence in early germ 
layer progenitor cells, we aimed at developing in vitro assays to 
investigate the complex range of migration behaviors observed 
in vivo under controlled conditions with a minimal set of defined 



CrossMark 



Cell 160 , 673-685, February 12, 2015 ©2015 The Authors 673 




Cell 




Figure 1. Zebrafish Germ Layer Progenitor Cells Exhibit Distinct Cell Migration Modes In Vitro 

(A) Bright-field (BF) and fluorescence images of blebbing progenitor cells cultured in serum-free medium in confinement with GPI-RFP (membrane, red) and 
Lifeact-GFP (cortex, green). Red arrow marks cellular bleb. Scale bar represents 20 i^m. 

(A') BF time-lapse images of non-motile progenitor cells related to culture conditions in (A). Blue asterisk indicates cell center. Scale bar represents 20 i^m. 

(B) TIRFM images of Lifeact-GFP in mesendodermal progenitors cultured on 2D substrates. Arrows mark filopodia (yellow) and lamellipodia (white). Scale bar 
represents 10 i^m. 

(B') BF time-lapse images related to culture conditions in (B). Orange asterisks indicate exemplary migrating cells. Scale bar represents 20 i^m. 

(B 7/ ) Representative tracks of motile mesendodermal cells over 20 min (time lag 30 s, n = 15 cells). 

(C) BF and fluorescence images of polarized progenitor cells with GPI-RFP (red) and Lifeact-GFP (green) cultured in 20% serum in confinement. Yellow arrow 
points at the bleb-like protrusion front. Scale bar represents 20 ^m. 

(CO BF time-lapse images of motile stable-bleb cells cultured as described in (C). Red asterisk indicates cell movement. Scale bar represents 20 i^m. 

(D and DO Sketch and BF images summarizing different migration phenotypes of embryonic progenitor cells in vitro. All progenitor cells were obtained from 
embryos at sphere stage and cultured on Fibronectin-coated substrates. Scale bar represents 20 i^m. 

See also Movie SI and Extended Experimental Procedures. 



environmental parameters. Early progenitor cells placed on 2D 
substrates displayed a characteristic blebbing morphology that 
can also be observed in early blastula stage embryos in vivo 
(Diz-Munoz et al., 2010). Notably, those blebbing cells failed to 
migrate irrespective of adhesive substrate coating with extracel- 
lular matrix (ECM) components, such as Laminin or Fibronectin 
(Figure 1A; Movie SI available online). However, when progeni- 
tor cells were induced to be of mesodermal or mesendodermal 
origin and placed on Fibronectin-coated substrates, they 
formed a characteristic mixture of lamellipodia and filopodia 
(Figure IB) and underwent collective migration with similar 
speed (<v Ce n> = 3.8 ± 0.3 (im/min) to their movement in vivo 
(Figure IB'; Movie SI). 

Strikingly, when adding serum to the culture medium, we 
observed unexpected changes in progenitor cell architecture 
with cells displaying a highly polarized cell morphology charac- 



terized by a stable pear-like shape and a large spherical protru- 
sion front (Figure 1C). These cells, which we termed stable-bleb 
cells, were non-motile when plated on adhesive 2D substrates, 
but became highly migratory in confined environments (Fig- 
ure 1C'; Movie SI). Thus, in the presence of serum, progenitors 
transformed into a novel migration mode in vitro, clearly distinct 
from previously described lamellipodia-, filopodia-, and bleb- 
based migration types (Figure ID). 

Changes in Cortical Architecture Precede Stable-Bleb 
Cell Polarization 

To investigate the molecular basis underlying this transforma- 
tion, we first asked which serum components are involved. While 
adding serum growth factors, such as fibroblast growth factor 
(FGF), platelet-derived growth factor (PDGF), or epidermal 
growth factor (EGF), to the culture medium was not sufficient 
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to induce stable-bleb cells (Figure SI A), we observed a fast and 
robust transformation when lysophosphatidic acid (LPA) was 
added (Figures 2A and SIB), a serum phospholipid known to 
activate cortical contractility via the Rho/Rock pathway (Mills 

and Moolenaar, 2003). 

Polarization was stable in the presence of LPA, but cells 
switched rapidly back into their original blebbing behavior 
upon dilution of LPA from the culture medium, indicating that 
LPA-mediated transformation of progenitors into stable-bleb 
cells is a reversible process (Figure SIC). Furthermore, cell po- 
larization was observed in suspension at homogenous levels of 
LPA (Movie S2) suggesting that cell transformation occurs in 
the absence of extrinsic gradients of biochemical or environ- 
mental polarity cues and arguing in favor of a cell intrinsic sto- 
chastic polarization mechanism induced by LPA. 

Surprisingly, transformation into stable-bleb cells could be 
triggered in progenitor cells from blastula and gastrula stages 
irrespective of their primary cell fate (Figure 2B) as determined 
by dissociating cells from Tg(mezzo:eGFP) transgenic embryos 
expressing eGFP in mesendodermal progenitor cells. We thus 
reasoned that the motile switch occurred independently of a 
cell-type-specific genetic module, but might rather be associ- 
ated with a generic polarization mechanism retrievable in these 
cells. 

Given that LPA is known to activate cortical contractility via the 
Rho/Rock pathway, we hypothesized that a mechanical polari- 
zation mechanism of the cell cortex could trigger the transforma- 
tion of progenitor cells into stable-bleb cells (Carvalho et al., 
2013). To test this hypothesis, we treated stable-bleb cells with 
the Rho kinase inhibitor Y-27632 or the myosin-ll inhibitor Bleb- 
bistatin (Figure 2C). Treated cells lost their characteristic polari- 
zation, supporting a critical role for Rho/Rock-mediated cortical 
contractility in driving stable-bleb cell transformation. 

To address the effect of LPA stimulation on stable-bleb cell 
polarization, we monitored the distribution of cortical markers 
in the presence of different LPA concentrations. Myosin II rapidly 
redistributed to the cell cortex upon LPA stimulation (t 1/2 ~20 s, 
Figure 2D), and rising LPA concentrations were associated with 
increasing cortical accumulations of myosin II, eventually satu- 
rating at a threshold level at which progenitors transformed 
into stable-bleb cells (Figures 2E and 2F). Importantly, the accu- 
mulation of myosin II at the cortex was up to seven times larger 
than the accumulation of actin, indicating that myosin ll-medi- 
ated cortical contractility was specifically elevated upon LPA 
stimulation (Figures 2G and SID). Furthermore, bleb sizes in 
dissociated progenitor cells closely scaled with the concentra- 
tions of LPA in the medium (Figures 2E and 2H), suggesting 
that LPA upregulates cortical contractility and enhances bleb 
expansion by increasing intracellular pressure. 

Interestingly, we observed that strong spatial confinement of 
progenitor cells in serum-free medium also led to elevated levels 
of myosin II accumulation at the cortex, larger blebs, and sto- 
chastic transformation into stable-bleb cells (Figures 2I-2K). In 
mesenchymal cells, this transformation was typically preceded 
by an increase in the formation of cellular blebs at the expense 
of lamellipodia and filopodia (Figure 2L). Together, these results 
imply that increasing levels of myosin-ll mediated cortical 
contractility by either biochemical or physical stimuli can trigger 



a rapid and reversible amoeboid transformation of non-polarized 
and mesenchymal progenitors (Figure 2M; Movie S3) that is 
accompanied with extensive remodeling of cortical architecture 
and myosin II localization in those cells. Notably, a similar 
mesenchymal-to-amoeboid transition (MAT) has been observed 
in various culture cell types upon compression (see Liu et al. 
[2015 in this issue of Cell), suggesting that contractility-induced 
MAT is a universal mechanism retrievable in various primary and 
culture cell types. 

A Contraction-Based Polarization Mechanism Drives 
Stable-Bleb Cell Transformation 

Based on these experimental results, we hypothesized that rapid 
changes in cortical architecture by increasing levels of cortical 
contractility mediate a cell intrinsic transformation into stable- 
bleb cells. To address this hypothesis, we developed a theoret- 
ical description of the polarization process by modeling cortex 
structures and associated cell shapes as a function of cortical 
contractility (see Extended Experimental Procedures). In this 
model, the cell cortex was treated as a thin spherical shell 
of compressible active gel (Callan-Jones and Voituriez, 2013; 
Flawkins et al., 2011) (Figure S2A). Assuming homogenous 
bulk polymerization and depolymerization rates, we precluded 
a predetermined symmetry break of the cortical layer. Instead, 
we treated blebbing events in unpolarized progenitor cells as 
random spatial fluctuations in cortical contractility (Figure 3A), 
an assumption supported by measurements of the spatial extent 
(bleb size) and amplitude of cortical instabilities (myosin II enrich- 
ment) during cell blebbing at increasing LPA levels (Figures S2B 
and S2C). Notably, our model predicted that above a threshold 
level of cortical contractility these local instabilities would not 
relax but be amplified, eventually transforming an initially ho- 
mogenous cortex layer into a polarized steady state character- 
ized by a stable cortical density gradient along with retrograde 
cortical flows (Figures 3B and S2D). 

Indeed, time-lapse imaging of myosin II localization during cell 
polarization in LPA stimulated progenitor cells confirmed a rapid 
myosin II re-localization to the cortex and strongly increasing 
bleb sizes prior to polarization, consistent with elevated levels 
of cortical contractility fluctuations and an increasing imbalance 
in contractile forces between the bleb and cellular cortical 
network (Figure 3C; Movie S4). Stabilization of blebs was initi- 
ated by the appearance of directional cortical flows from the 
weak actomyosin-network assembling underneath the bleb to- 
ward the cell cortex, which resulted in a failure of bleb retraction 
and ultimately led to stable cell polarization (Movie S4). To test 
whether ectopically increased levels of contractility can trigger 
cell polarization, we generated a localized LPA diffusion gradient 
by a micropipette (Figure 3D). Progenitor cells in the presence of 
this LPA gradient rapidly transformed into stable-bleb cells with 
their contractile back oriented toward the source of LPA (Figures 
3E, 3F, and S2E). This suggests that external gradients of LPA 
can trigger directional cell polarization by inducing an asym- 
metric contraction of the cortical cytoskeleton in agreement 
with theoretical modeling. 

We next asked how stable-bleb cells maintain their polarity. 
According to our theoretical model, we expected that cell polar- 
ization in stable-bleb cells could potentially be maintained by a 
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Figure 2. LPA Induces Cell Polarization to Stable-Bleb Cells In Vitro 

(A) Percentage of polarized stable-bleb cells over time upon stimulation with 100 pM LPA. 

(B) Percentage of polarized stable-bleb cells from blastula stage embryos (brown, n = 287) or 75% epiboly stage Tg{mezzo:eGFP) transgenic embryos with 
mesendodermal (green, n = 183) and ectodermal progenitors (blue, n = 152) after 10 min in the presence of 100 pM LPA (+LPA) or DMEM-F12 culture medium 
alone (Control [Ctrl] at blastula stage [n = 321], mesendodermal [n = 286], and ectodermal [n = 251]). 



(legend continued on next page) 
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positive feedback loop between cortical contractility gradients 
and the presence of cortical flows (Bois et al., 2011; Hawkins 
et al., 2011). In this positive feedback, cortical flows reinforce 
density gradients, in particular rearward localization of myosin 
II, and thereby reinforce contractility gradients that drive cortical 
flow toward the contractile region (Figure 3G). High resolution 
TIRF imaging of cortical actin and myosin II in polarized progen- 
itor cells confirmed the presence of stable cortical density gradi- 
ents toward the cell rear and revealed a low density actomyosin 
network in the spherical protrusion front (Figures 4A, 4B, S3A, 
and S3B). This sparse actomyosin meshwork was reminiscent 
of a bleb-like membrane blister but, unlike blebs, was accompa- 
nied by an unusually fast and continuous cortical actomyosin 
flow, referred to as cortical flow in the following, with maximal 
flow speeds up to 150 (im/min in the very cell front (Figures 4A, 
4C, S3C, and S3D). Measurement of cortical flows along with 
cortical density profiles allowed for calculating cortex flux and 
cortex turnover rate (Figure 4D), indicating net polymerization in 
the spherical protrusion front and de-polymerization toward the 
rear. As a continuous rearward cortex flux requires permanent 
cortex turnover, stable-bleb cell polarization was rapidly lost 
upon treatment with the G-actin sequestering drug Latrunculin 
A or Jasplakinolide, a drug that interferes with cortex turnover 
(Figures 2C and B E; Movie S2). Moreover, treatment with the 
myosin II inhibitor Blebbistatin also reversed cell polarization 
(Figure 2C), indicating that cortical flow in combination with a 
gradient in contractility is critical for maintaining stable-bleb cell 
polarization overtime. In contrast, inhibition of CDC42 (ML-141) 
or PI3Kinase (L-294002), previously shown to be required for 
mesendodermal progenitor cell migration in vivo (Montero 
et al., 2003), did not affect stable-bleb cell polarization (Fig- 
ure S2F), supporting the concept that stable-bleb cell motility is 
unrelated to actin driven protrusion types such as lamellipodia 
orfilopodia. Collectively, our results support a simple mechanical 
model of stochastic cell polarization based on the amplification of 
local fluctuations in cortical contractility and a positive feedback 
mechanism between contractility gradients and continuous 
cortical flows maintaining polarity in stable-bleb cells (Figure 3H). 

To determine how the polarized cortical scaffold in stable-bleb 
cells connects to their characteristic shape, we expanded our 
theoretical model to calculate cell shapes based on deforma- 



tions of the cytoplasm due to compressive and shear stresses 
at the cytoplasmic surface arising from cortical tension and 
flow-friction (Behrndt et al., 2012) (Figures S3E and S3F). The 
cytoplasm was modeled as a viscoelastic medium with residual 
elastic response at long times (Tinevez et al., 2009), which is the 
regime of interest for discussing steady-state shapes. While 
measured actomyosin density gradients serve as a readout of 
cortical contractility, cortical tension in general is related not 
only to contractility but also to viscous forces arising from flow 
(Mayer et al., 2010) and flow-induced filament alignment (Sal- 
breux et al., 2009). Using experimentally determined cortical 
density profiles we calculated shapes for two different cases: 
In the first case, we assumed an isotropic tension resulting in 
flattened, bulbous cell shapes not matching our experimental 
observations (Figure 4G, bottom). In the second case, we 
included viscous forces leading to tension reduction along the 
flow direction. Such tension anisotropy resulted in elongated 
pear shapes closely mirroring the observed shapes of stable- 
bleb cells (Figure 4G, top). In support of our theoretical predic- 
tions, we observed a preferential alignment of actin filaments 
perpendicular to the front-to-rear axis of stable-bleb cells, high- 
lighting the presence of anisotropic tension within the cortex 
(Figures S3I-S3K). Finally, we tested whether stable-bleb cell 
shapes depend on cortical viscosity and cell stiffness (Figures 
4H and S3G) and used osmotic shock experiments to alter intra- 
cellular pressure and, consequently, cell stiffness. Consistent 
with our theoretical predictions, we found that decreasing intra- 
cellular pressure resulted in more elongated cell shapes (Fig- 
ure S3H). Together, our theoretical considerations suggest that 
the characteristic pear-shape morphology of stable-bleb cells 
is generated by anisotropic tension in the cell cortex, which 
acts as a belt-like constriction unit applying compressive forces 
normal to the axis of polarization. 

Retrograde Cortical Flows Drive Stable-Bleb Cell 
Migration 

To study the migratory behavior of polarized stable-bleb cells, 
we monitored their motility in different environments (Figure 5A). 
Stable-bleb cells only weakly adhered to and failed to migrate 
on 2D adhesive substrates coated with ECM components, 
such as Laminin and Fibronectin. Fluorescence imaging of the 



(C) Percentage of polarized stable-bleb cells cultured in DMEM-F1 2 medium (Ctrl, n = 272), 1 00 fiM LPA (+LPA, n = 21 9) or 30 fiM LPA supplemented with 1 0 jiM 

Blebbistatin (+) (n = 167), 10 |WI Blebbistatin (-) (n = 143), 10 Y-27632 (n = 149), 100 nM Latrunculin A (LatA, n = 134), and 500 nM Jasplakinolide (Jasplak) 

(n = 83). 

(D) Recruitment of myosin II to the cortex of progenitor cells overtime after application of 10 ^iM LPA by a micropipette. Data (black, mean ± SEM) and sigmoid fit 
function (red) with half time t 1/2 = 20 s (n = 30). 

(E) Myosin II localization in progenitor cells cultured in DMEM-F12 medium alone (Ctrl), 1 [iM or 10 jxM LPA. Arrows mark blebs (yellow) and the cortex-depleted 
protrusion front of stable-bleb cells (red). 

(F) Boxplot of relative cortical myosin II fluorescence intensities for increasing LPA levels in unpolarized progenitor cells (each n = 120). 

(G) Cortical enrichment of Myl12.1-eGFP (n = 60), Lifeact-GFP (n = 60), and GPI-RFP (n = 33) in non-polarized progenitor cells treated with 10 LPA. Cortical 
fluorescence intensity values l LPA are normalized to reference values without LPA stimulation l ctr |. 

(H) Relative bleb sizes for increasing LPA levels (each n = 120). 

(I and J) Relative bleb sizes (I) and relative cortical myosin II fluorescence intensities (J) under non-confined and confined conditions. 

(K) Exemplary fluorescence images of non-polarized and polarized cells expressing Myl12.1 in non-confined and confined conditions. 

(L) BF images of mesodermal progenitors cultured in non-confined and confined environments (top) and quantification of cellular protrusions (bottom). 

(M) Time-lapse BF and fluorescence images of Myl12.1-eGFP localization in mesodermal progenitor cells cultured on a 2D substrate coated with Fibronectin 
upon LPA stimulation, n.s., not significant, ***p < 0.001 . Boxplots show single cell data points (black) and median (red). All cells were obtained from embryos at 
sphere stage, despite (B), and cultured in suspension on a passivated substrate despite (L and M). n, number of cells. All scale bars represent 20 urn. 

See also Figure SI , Movies S2 and S3, and Extended Experimental Procedures. 
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Figure 3. Cortical Contractility Fluctuations Drive Polarization to Stable-Bleb Cells 

(A) Sketch of polarization from a non-polarized state (left) to a polarized stable-bleb state (right). The cell cortex is shown in green and highly contractile cortex 
areas in red. Red arrows denote a cortical force imbalance due to local fluctuations in cortical contractility. Blebs induce spatial inhomogeneities in cortical 
contractility of increasing strength for larger bleb sizes. Green arrows indicate cortical flows. 

(B) Stability diagram illustrating the switch from a non-polarized to a polarized state. The curve represents the values of cortical contractility £, depending on the 
density fluctuation mode number /. The mode number, /, is inversely proportional to the wavelength (in units of initial cell radius) of the fluctuation. 

(C) Myosin II localization in a transforming progenitor cell cultured in suspension with 30 LPA. Scale bar represents 20 i^m. 

(D) Sketch of the local LPA application experiment. 

(E) BF time-lapse images and Myl12.1-eGFP localization during progenitor cell polarization in a local 100 ^iM LPA diffusion gradient set by a micropipette (red 
arrow). 

(F) Histogram of measured cell polarization orientations (n = 32). 

(G and H) Schematic illustration of the positive feedback loop between cortical actomyosin density gradients and rearward cortical flows for the maintenance of 
stable-bleb cell polarity. Local fluctuations in cortical contractility induce a flow toward the contractile region along with mass transport of actin and myosin (red 
dots) that reinforces the initial instability (G). To maintain a continuous cortical flow in polarized cells actin turnover (with rate constants k p and k d ) and diffusion of 
free actin and myosin (with diffusion constant D) is required (H). Polarization is lost upon pharmacological inhibition of myosin II contractility (by Blebbistatin), actin 
polymerization (by Latrunculin-A) and actin de-polymerization (by Jasplakinolide). 

See also Figure S2, Movies S2 and S4, and Extended Experimental Procedures. 
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Figure 4. Cortical Architecture Determines Cell Shape of Stable-Bleb Cells In Vitro 

(A) TIRFM image showing Lifeact-GFP (left) and myosin II localization (right) in isolated stable-bleb cells with corresponding kymograph data along yellow lines. 
Orange dotted line indicates the cortical flow profile. 

(B) Average actin and myosin II density profiles obtained from culture conditions in (A) (n = 30). 

(C) Average 2D cortical flow map in the spherical protrusion front of stable-bleb cells (n = 3). The red dashed square highlights the analyzed cell area. 

(D) Cortical flux data (black, mean ± SEM) calculated from average cortical density (B) and average cortical flow profiles (Figure S3C) with polynomial fit (gray) and 
cortex turnover rate (blue) calculated as the derivative of the cortical flux (n = 30). 

(E) BF and fluorescence cross-sectional image of an isolated stable-bleb cell expressing GPI-RFP. 

(F) Average (red) and single cell shapes (gray) of polarized stable-bleb cells (n = 30). 

(G) Theoretical predictions on polarized cell shapes generated by cortical tension components parallel T„ and perpendicular T ± to the polarization axis. Top: 
anisotropic tension (7// < T ± ). Bottom: isotropic tension (7// = T ± ). Color bar represents normalized cortical density distributions. Magenta arrows indicate the 
direction of cortical flow v(6) in polar direction. Parameter values are provided in the Extended Experimental Procedures. 

(H) Phase diagram of cell shapes with cell aspect ratio A as a function of tension anisotropy. Elongated, pear shapes are obtained for higher anisotropy. All cells 
were cultured on a PEG-coated substrate either with (A-D) or without confinement (E and F). Scale bars represent 10 [xm. 

See also Figure S3 and Extended Experimental Procedures. 



subcellular distribution of molecules associated with coupling 
the cortex to the surrounding substrate, such as N-Cadherin, 
Ezrin, and Integrin revealed a highly asymmetric accumulation 
toward the cell rear, while the bleb-like protrusion front was 
largely devoid of those molecules (Figures 5B and 5C). This indi- 
cates only weak adhesive forces in the front that are insufficient 
for forward locomotion on 2D substrates. To increase frictional 
coupling of migrating cells, we thus created a planar 2D confine- 



ment by placing cells between a coated glass substrate and an 
agarose layer. We found that stable-bleb cells plated on Fibro- 
nectin-coated substrates under confinement displayed excep- 
tionally fast and directional movements with average cell speed 
<v ce n> = 17.1 ± 0.6 (im/min (n = 67 cells) and persistence length 
of approximately four cell diameters (Figures 5E and 5F), best 
described by a persistent random walk model over long time- 
scales (Figure 5F). 
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Figure 5. Migration Characteristics of Stable-Bleb Cells In Vitro 

(A) Illustration of progenitor cells cultured under serum/LPA-free (I, II) or serum/LPA-containing (III, IV) conditions with (II, IV) or without (I, III) confinement. 

(A') Table summarizing cell migration characteristics for different culture conditions. 

(B-D) N-Cadherin-GFP (B), Ezrin-YFP (C), and lntegrina5-lntegrin(31-BiFc-Venus (D) localization in polarized stable-bleb cells. 

(E) Representative tracks of migrating stable-bleb cells. 

(F) Mean-square displacement (MSD) analysis data (blue, n = 67) with fit to a persistent random walk model (gray) and compared to random (~t, orange) and 
directed migration (~t 2 , red). Black dashed line highlights random migration on timescales above the persistence time P t ~1 0 min (persistence length « 1 50 |xm). 
(F') Sketch of MSD time point analysis. Arrows denote variations in the direction of movement affecting persistence of cell movement. 

(G) Average instantaneous stable-bleb cell migration speeds in confinement between glass-agarose or glass-glass assays for varying substrate coatings as 
indicated. 

(H-J) Localization of Paxillin-GFP for stable-bleb cells on BSA- (H), PEG- (I), and Fibronectin-coated (J) glass substrates in confinement. 

(K) Paxillin-GFP localization for mesodermal progenitors cultured on 2D planar substrates coated with Fibronectin. n denotes number of cells. White dashed lines 
in (H)-(K) outline cell contact areas. All scale bars represent 10 ^m. 

See also Extended Experimental Procedures. 



To test the role of specific versus unspecific adhesion, we 
monitored stable-bleb cell migration in various adhesive and 
non-adhesive environments using a modified assay that consists 
of two parallel glass slides coated with either Fibronectin, poly- 
L-lysine (PLL), BSA, or polyethyleneglycol (PEG) separated by 
15-fim beads. Surprisingly, stable-bleb cells were highly motile 
under all conditions with their migration efficacy increasing in 
non-adhesive environments (Figure 5G). To monitor the forma- 
tion of adhesive foci in stable-bleb cells during migration, we 
further imaged Paxillin-GFP, an adaptor protein known to be 
involved in Integrin-mediated force transduction and focal adhe- 
sion assembly in motile cells (Turner, 2000). When stable-bleb 



cells were placed between Fibronectin-coated surfaces, Paxil- 
lin-GFP localized in distinct spots at cell-substrate contacts 
similar to mesenchymal mesodermal cells cultured on 2D planar 
Fibronectin-coated substrates, while Paxillin-GFP appeared ho- 
mogeneously distributed on non-adhesive BSA and PEG-coated 
surfaces (Figures 5H-5K). This suggests that stable-bleb cells 
are able to generate both Integrin-dependent and independent 
surface attachments during migration. To further investigate 
the role of cell-substrate versus cell-cell adhesion, we also 
reduced E-Cadherin expression in stable-bleb cells by injecting 
antisense morpholino oligonucleotides (MOs) and found that 
E-Cadherin is dispensable for cell migration in vitro irrespective 
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Figure 6. Frictional Coupling of Retrograde 
Cortical Flows Drives Stable-Bleb Migration 

(A and D) TIRFM images of (A) Myl12.1-eGFP and 
(D) Lifeact-GFP localization in motile stable-bleb 
cells. White dashed line marks the cell-to-substrate 
contact area. Scale bars represent 10 jxm. 

(B and E) Kymograph data along the direction of 
movement indicated by yellow lines in (A) and (D). 
Green dashed lines outline cell front and rear 
advancement. White dashed line highlights the 
rearward flow of a representative fluorescent 
patch. 

(C and F) Retrograde cortical flow velocity data in 
single stable-bleb cells obtained from kymograph 
data in (B) and (E) with v Ce ii = 24.1 jim/min (C) and 
v Ce ii = 18-2 i^m/min (F). 

(G and H) Average (black) and single cell local flow 
velocity data (red) of migrating stable-bleb cells (FI) 
(n = 15) with average cell speed v Ce ii = 19.2 ± 
2.4 ^m/min and of non-motile stable-bleb cells (G) 
(n = 30) cultured in confinement under agarose on a 
non-adhesive PEG-coated substrate (v Ce n = 0). 
Shaded areas denote regions with zero [v = 0, light 
gray) or anterograde (v < 0, dark gray) cortical flow. 
Cortical flows are shown in the laboratory frame 
of reference. All cells were cultured in the presence 
of 30 [iM LPA in confinement under agarose on 
Fibronectin-coated substrates despite data in 
(G). n, number of cells. Error bars represent 
mean ± SEM. 

See also Figure S4, Movie S5, and Extended 
Experimental Procedures. 



of the substrate coating (Figure S4F). Together, these data 
support that stable-bleb cells are able to generate both specific 
and unspecific traction forces during migration, highlighting their 
versatile migration competence in various adhesive and non- 
adhesive environments. 

To understand this efficient and persistent motile behavior, we 
asked how stable-bleb cells move in 2D confined environments. 
Depletion of cortical elements in the cell front (Figure 4A) and 
the diffuse zone of actin net polymerization in the spherical front 
(Figure 4D), exclude a classical polymerization-driven locomotion 
strategy, where polymerizing actin filaments push the leading 
edge plasma membrane forward. However, the presence of fast 
retrograde cortical flows led us to hypothesize that coupling of 
the flowing actin network to the surrounding substrate powers 
stable-bleb cell migration under 2D confinement. To test this 
hypothesis, we performed high-resolution live cell imaging of 
retrograde cortical flows in stable-bleb cells in confinement under 
agarose. Motile stable-bleb cells on adhesive Fibronectin-coated 
substrates showed a rapid decay of flow velocities toward the cell 
rear with a pronounced zone of zero flow speed (Figures 6A-6F 
and 6H; Movie S 5). In contrast, non-motile stable-bleb cells that 
were placed on passivated PEG-coated substrates revealed 
considerably different retrograde flow profiles with similar 
maximal flow velocities to motile cells in the cell front, but a slower 
decay of flow speeds to the cell rear (Figures 6G and S3D). Using 
a ID fluid description of the cortex, we estimated from 
those cortical flow profiles a higher frictional coefficient 5 on adhe- 
sive in comparison to non-adhesive surfaces (Adhesive — 2.8 x 10 8 



Pa x s/m versus ^on-adhesive — 1 x 1 0 8 Pa x s/m; Figures S4A- 
S4C). Furthermore, osmotic shock experiments revealed that 
cortical flows are progressively decreased for lower levels of intra- 
cellular pressure and that these changes in cortical flow velocity 
strongly correlate with cell migration speed (Figures S4D and 
S4E). These findings support the notion that coupling of the cortex 
to the surrounding substrate leads to a reduction of cortical 
flow in motile cells, providing the necessary forward traction force 
required for cell movement. 

Evidence for Stable-Bleb Cell Migration In Vivo 

To study the motile switch and migration characteristics of sta- 
ble-bleb cells in vivo, we first aimed to ectopically transform cells 
in the early zebrafish blastula by elevating their cortical contrac- 
tility. To this end, we expressed a constitutively active version of 
RhoA (caRhoA) (Takesono et al., 2012) in single progenitor cells 
and performed two-photon imaging of their morphology 
and migration properties in the gastrulating embryo. CaRhoA 
expressing cells in vivo showed high blebbing rates and 
pronounced cortical myosin accumulations similar to LPA-stim- 
ulated progenitor cells in vitro (Figure 7A). Moreover, caRhoA- 
expressing cells frequently transformed into motile cells closely 
resembling stable-bleb cells in vitro displaying a pear-shape, 
pronounced cortical actin and myosin II gradients and rearward 
cortical flows of similar magnitude as found in vitro (v corte x 
~30 fim/min; Figures 7B and S5A-S5C; Movie S6). To further 
test whether caRhoA expressing progenitor cells in vivo indeed 
resemble stable-bleb cells in vitro, we transferred them into 
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Figure 7. Identification and Characterization of Stable-Bleb Cells In Vivo 

(A) Myl12.1-eGFP (green) and GPI-RFP (magenta) localization in non-polarized embryonic cells observed in a Tg(act(31 :myl12.1eGFP) transgenic embryo at 
sphere stage expressing caRhoA and GPI-RFP in a mosaic pattern. Asterisks denote cell nuclei in the vicinity of a caRhoA expressing cell highlighting cells with 
reduced cortical accumulation of myosin II compared to caRhoA expressing cells. Scale bars, 10 i^m. 

(B) Fluorescence time-lapse series of a stable-bleb like cell in a Tg(actf31:myl12. leGFP) transgenic embryo at sphere stage expressing caRhoA and GPI-RFP in a 
mosaic pattern. Scale bars represent 10 [im. 

(C) Illustration of the transplantation experiment in (D). 

(D) Transplantation of cells from a Tg{act(31 :myl12. leGFP) transgenic donor to a wild-type host embryo at sphere stage. A stable-bleb-like cell emerging from the 
transplanted tissue is highlighted in the red box. Scale bar represents 100 urn. 



(legend continued on next page) 
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culture and found that they exhibited a morphology and migra- 
tion behavior indistinguishable from LPA-induced stable-bleb 
cells in vitro (Figures S4G-S4J). This suggests that upregulation 
of cortical contractility through expression of caRhoA is suffi- 
cient to transform progenitor cells into motile stable-bleb cells 
in vivo. 

To unravel the role of cell-cell adhesion for adhesive force 
coupling during stable-bleb cell motility in vivo, we downregu- 
lated E-Cadherin expression in caRhoA expressing embryos 
by injecting specific e-Cadherin MOs. Upon transplantation of 
those cells into a wild-type host we found that E-Cadherin 
was dispensable for stable-bleb migration in vivo and even 
led to an increase of cell dispersal compared to caRhoA ex- 
pressing cells with normal levels of E-Cadherin (Figures 7H 
and S5D-S5F). 

We next sought to address if stable-bleb cells also emerge in 
the endogenous context of embryonic development. Time- 
lapse movies of cell movements within the gastrulating embryo 
occasionally revealed single fast migrating amoeboid cells 
reminiscent of stable-bleb cells (see Movie S7). However, since 
it was impossible to predict where and when in the gastrulating 
embryo these cells appear, we reasoned that locally increasing 
cortical contractility within the gastrulating embryo might 
trigger stable-bleb cell transformation. Wounding sites have 
previously been shown to display elevated levels of actomy- 
osin contractility (Sonnemann and Bement, 2011). To test if 
stable-bleb cells emerge at local wounding sites within the 
gastrulating embryo, we transplanted fluorescently labeled 
donor cells into an unlabeled host embryo at sphere stage 
(4 hr post fertilization [hpf]) (Figures 7C and 7D). Using 
Tg(act(31 :myl1 2.1 eGFP) donor embryos, elevated cortical accu- 
mulations of myosin II were detected around the wounding 
area (Movie S8) and, interestingly, labeled donor cells stochas- 
tically transformed into stable-bleb like cells with a highly 
polarized cortical architecture and fast retrograde cortical 
flows similar to their morphology in vitro (Figure E; Movie 
S8). Transformed cells migrated away from the region of trans- 
plantation with speed (v = 8.5 ± 0.3 fim/min) and directionality 
much higher than any other cells yet described in the zebrafish 
embryo (Figures 7F and 7G). Together, this indicates that sta- 
ble-bleb motility can be induced at places of high contractility 
within the gastrulating embryo, suggesting a mechanism for 
rapid cell extrusion from highly contractile regions within the 
embryo. To further test this assumption, we prepared tissue 
explants from early blastula stage embryos and cultured 



them in LPA containing medium. We found that marginal cells 
in tissue explants not only rapidly transformed into stable-bleb 
cells, as expected for LPA-treated progenitors, but also 
frequently extruded from the explant surface (Figure 71). This 
suggests that stable-bleb cells can overcome the surface ten- 
sion of those tissue explants and freely disperse into the 
environment, supporting our notion of stable-bleb cell transfor- 
mation as an efficient cell extrusion mechanism from contrac- 
tile tissues. 

DISCUSSION 

Here, we describe a stochastic amoeboid motility switch of early 
zebrafish progenitor cells into stable-bleb cells. Cell morphology 
and migration characteristics of stable-bleb cells show unique 
features in comparison to other motile cell types described 
before. The invariant roundish cortex-depleted front end of 
stable-bleb cells is clearly different from actin rich protrusions, 
such as lamellipodia and filopodia typically found in mesen- 
chymal cells and also distinct from contractility-driven bleb pro- 
trusions that periodically appear in blebbing cells. We show that 
stable polarization of transformed progenitor cells is maintained 
by a positive feedback loop between cortical density gradients 
and continuous cortical flows, yielding a robust steady-state 
configuration with a highly polarized cortical architecture. Inter- 
estingly, we found that increasing cortical contractility is suffi- 
cient to switch progenitor cells from a blebbing or mesenchymal 
mode to a stably polarized state above a critical threshold level 
of contractility, and that this transformation is reversible upon 
lowering myosin II activity below a critical threshold level. This 
indicates that the level of cortical contractility determines 
whether cells migrate by forming either transient protrusions, 
such as blebs and lamellipodia, or one stably polarized front 
thereby increasing their migration persistence and efficacy. 
Supported by theoretical modeling of progenitor transformation 
into stable-bleb cells, it further suggests that this process does 
not require an elaborate genetic program, but rather occurs as a 
simple mechanical switch triggered by increased cortical 
contractility. This generic polarization module can likely be acti- 
vated via diverse cellular pathways such as microtubule disas- 
sembly-dependent activation of the Rho/ROCK pathway (Niggli, 
2003) and could thus be a potentially effective polarization 
mechanism also functional in other cell types. Notably, findings 
in various culture cell lines suggest that high spatial confinement 
combined with low substrate adhesion is sufficient to trigger an 



(D 7 and D 77 ) Magnified views on transplanted stable-bleb-like cells observed in (D). Dashed white line marks cell borders. Red arrow indicates the spherical 
protrusion front. Scale bar represents 20 |xm. 

(E) Myl12.1-eGFP localization and cortical flow in a stable-bleb cell obtained from transplantation of Tg(acti31 :myl12.1eGFP) transgenic donor cells into a 
wild-type host. Red arrows point to the direction of cortical flow. Dashed yellow lines outline cell borders. Scale bar represents 25 |xm. 

(F and G) Representative tracks (F) and instantaneous migration speeds (G) of stable-bleb cells in vivo (n = 15 cells) with average velocity (<v ce n> = 8.5 ± 
0.3 i^m/min; yellow line). 

(FI) Time-lapse fluorescence images of a cell transplantation between Myl12.1-eGFP donor embryos expressing caRhoA (Ctrl) or caRhoA+E-Cadherin mor- 
pholino (Ecad-MO) and a wild-type host at dome stage. Dashed white lines outline the embryo margin. 3Scale bar represents 100 |xm. 

(IT) Schematic illustration of the transplantation experiment in (FI). 

(I) Sketch of tissue explant preparation and culturing. 

(I 7 ) BF time-lapse images of a blastoderm tissue explant obtained from wild-type embryos at sphere stage and cultured with 30 [xM LPA. Scale bar represents 
100 ^m. 

See also Figure S5, Movies S6, S7, and S8, and Extended Experimental Procedures. 
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amoeboid cell transformation strikingly similar to stable-bleb 
cells (see Liu et al., 2015). Together, these data suggest that 
the regulation of cortical contractility might serve as a universal 
switch for transforming cells into an amoeboid cell migration 
mode characterized by fast and highly directional migration in 
3D confinement. 

Spatially confined environments promote stable-bleb cell 
migration by providing compressive forces that increase frictional 
coupling required for efficient locomotion. Moreover, retrograde 
cortical flows are sufficient to drive stable-bleb cell migration in 
both adhesive and non-adhesive confined environments. Similar 
flow-friction mechanisms have recently been proposed to drive 
locomotion of various cell types in confinement (Hawkins et al., 
2009; Liu et al., 2015), suggesting that force-coupling via retro- 
grade cortical flows might constitute a general mechanism of 
cell movement especially in low adhesive 3D environments. 

During gastrulation cell motility needs to be precisely 
controlled and spatio-temporally regulated. Recent studies 
have pointed at important functions of small Rho GTPases 
in cell migration during zebrafish gastrulation (Matthews et al., 
2008; Kardash et al., 2010). Our observation that LPA, a RhoA- 
activating signal ubiquitously expressed during zebrafish 
gastrulation (Lee et al., 2008; Yukiura et al., 2011), can transform 
progenitor cells into stable-bleb cells, reveals a yet unrecognized 
function of RhoA in the control of cell migration during gastrula- 
tion and points at LPA as one potential regulator determining sta- 
ble-bleb cell induction during gastrulation. However, given the 
large number of known signals with RhoA activating capacity, 
a systematic functional analysis of these signals will be needed 
to dissect the upstream regulation of stable-bleb cell transfor- 
mation during gastrulation. 

As to the function of stable-bleb cells in gastrulation, we 
observed stable-bleb cell transformation at areas of high cell 
contractility, such as wounding sites, followed by rapid dispersal 
of those cells. This suggests that stable-bleb transformation 
serves as a mechanism for cell extrusion from areas of increased 
contractility in the embryo. This might help in equilibrating 
contractility-driven cell density variations, but might also be 
used as a mean of long-range communication where extruding 
stable-bleb cells rapidly transmit signals from areas of high 
contractility to the remainder of the embryo. Our observation of 
stable-bleb cell extrusion from tissue explants along with obser- 
vations of a similar migration phenotype in various cancer cell 
lines (see Liu et al., 2015) also points at the intriguing possibility 
that stable-bleb cells might be involved in other contexts such as 
cancer metastasis where cancer cells disseminate from the pri- 
mary tumor into unaffected tissues. Thus, stable-bleb cell trans- 
formation identifies a yet unrecognized mechanism for rapid and 
long-range cell dispersal in vertebrates. 

EXPERIMENTAL PROCEDURES 
Zebrafish Maintenance 

Zebrafish ( Danio rerio) were maintained as previously described (Westerfield, 
2007). Embryos were kept in E3 medium at 25°C-31°C prior to experiments 
and staged based on morphological criteria (Kimmel et al., 1995) and hours 
postfertilization (hpf). Wild-type embryos were obtained from the Tup Longfin 
(TL) background unless indicated otherwise. See Extended Experimental 
Procedures for transgenic fish lines and mRNA/morp/7o//no injections. 



Cell Culture 

To culture primary germ layer progenitor cells, embryos were manually de- 
chorionated in E3 buffer at sphere stage (4 hpf) or 75% epiboly stage (8 hpf). 
Five to ten embryos were transferred to DMEM-F12 culture medium (Sigma) 
and mechanically dissociated by manual tapping followed by centrifugation 
at 200 x g for 2 min. To induce stable-bleb polarization, 100 [xM LPA was 
added during centrifugation unless indicated otherwise. See Extended Exper- 
imental Procedures for more details. 

Reagents and Inhibitor Treatments 

Fetal BSA (GIBCO) and 1-Oleoyl lysophosphatidic acid (LPA, Tocris Biosci- 
ence) were used at the indicated concentrations. Fibroblast growth factor 
mFGF-8b and rhFGF-3 (R&D Bioscience), platelet-derived growth factor 
PDGF-AA (PeproTech), and epidermal growth factor hEGF (Sigma) were 
used at a final concentration of 100 ng/[xl. Pharmacological inhibitors were 
used at the following concentrations: 10 active (-) or inactive Blebbistatin 
(+) (Tocris Bioscience), 100 nM Latrunculin-A (Sigma), 100 Y-27632 (Tocris 

Bioscience), 500 nM Jasplakinolide (Tocris Bioscience), 20 |xM ML-141 
(Sigma), and 30 L-294002 (Sigma). The percentage of polarized cells 

was measured after 15 min (Blebbistatin, LatrunculinA, and Jasplakinolide) 
and 30-60 min (Y-27632, ML-141 , Y-294002) of exposure. 

Data Statistics 

Data are presented as mean ± SEM unless indicated otherwise. Statistical 
tests were performed by using the Wilcoxon rank sum test in MATLAB. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Extended Experimental Procedures, five 
figures, and eight movies and can be found with this article online at http:// 
dx.doi.org/1 0.101 6/j.cell.201 5.01 .008. 
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In Brief 

WHIM syndrome is an inherited 
immunodeficiency caused by overactivity 
of CXCR4, a receptor controlling 
production and distribution of leukocytes 
in bone marrow and blood. We identified 
a WHIM patient cured by chromothripsis 
(chromosome shattering) that fortuitously 
deleted the abnormal copy of the CXCR4 
gene in a single hematopoietic stem cell, 
which then took over the bone marrow 
and restored normal immune function. 
This experiment of nature suggests that 
partial CXCR4 inactivation might enhance 
engraftment of bone marrow in patients 
requiring transplantation. 

Highlights 

• CXCR4 haploinsufficiency may promote HSC engraftment 

• Chromothriptic deletions result in functional cure of WHIM 
syndrome 

• Clinical symptoms in WHIM syndrome are dependent on 
myeloid deficits 

• Myeloid-derived cells are critical for control of HPV infection 
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SUMMARY 

Chromothripsis is a catastrophic cellular event 
recently described in cancer in which chromosomes 
undergo massive deletion and rearrangement. Here, 
we report a case in which chromothripsis spontane- 
ously cured a patient with WHIM syndrome, an 
autosomal dominant combined immunodeficiency 
disease caused by gain-of-function mutation of the 
chemokine receptor CXCR4. In this patient, deletion 
of the disease allele, CXCR4 R334X , as well as 163 
other genes from one copy of chromosome 2 
occurred in a hematopoietic stem cell (HSC) that re- 
populated the myeloid but not the lymphoid lineage. 
In competitive mouse bone marrow (BM) transplan- 
tation experiments, Cxcr4 haploinsufficiency was 
sufficient to confer a strong long-term engraftment 
advantage of donor BM over BM from either wild- 
type or WHIM syndrome model mice, suggesting a 
potential mechanism for the patient’s cure. Our find- 
ings suggest that partial inactivation of CXCR4 may 
have general utility as a strategy to promote HSC 
engraftment in transplantation. 

INTRODUCTION 

WHIM syndrome is an autosomal dominant combined primary 
immunodeficiency disease caused by mutations in the chemo- 
kine receptor CXCR4 (Hernandez et al., 2003). The term 
“WHIM” is an acronym for the main manifestations of the dis- 
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ease: warts, hypogammaglobulinemia, recurrent infections, 
and myelokathexis; myelokathexis refers to impaired egress of 
mature neutrophils from bone marrow causing neutropenia 
(Wetzler et al., 1990; Zuelzer, 1964). Most patients with WHIM 
syndrome are actually panleukopenic, with severely reduced 
peripheral blood B cells but less severe reductions in peripheral 
blood T cells and monocytes (McDermott et al., 2011). The 
signature pathogen in WHIM syndrome is human papillomavirus 
(HPV), which causes warts that cannot be controlled with stan- 
dard medical treatment and may progress to cancer (Al Ustwani 
et al., 2014; Beaussant Cohen et al., 2012; Dotta et al., 2011; 
Kawai and Malech, 2009). Recurrent bacterial infections also 
occur, mainly in the sinopulmonary tract, oral cavity, ear, skin, 
and soft tissue, where chronic complications may arise, espe- 
cially bronchiectasis and hearing loss. Prophylactic antibiotics, 
IVIg and G-CSF are often used to reduce the incidence of infec- 
tions; however, their precise efficacy has not been established 
(Al Ustwani et al., 2014; Beaussant Cohen et al., 2012; Dotta 
et al., 2011; Kawai and Malech, 2009). In contrast, safety and 
preliminary evidence of clinical efficacy has recently been re- 
ported from Phase I studies of the specific CXCR4 antagonist 
plerixafor (Mozobil, AMD3100) (Dale et al., 2011; McDermott 
et al., 201 1 , 201 4). Spontaneous remission or cure of WHIM syn- 
drome has not been previously reported. 

WHIM mutations of CXCR4 increase signaling because they 
disrupt negative regulatory elements in the carboxy-terminus, 
thereby exaggerating the normal hematopoietic functions of 
the receptor (Haribabu et al., 1997; Signoret et al., 1998; Venka- 
tesan et al., 2003). CXCR4 is normally expressed by most leuko- 
cytes and has one ligand, CXCL12 (Bachelerie et al., 2014; Bleul 
et al., 1996), which is constitutively expressed at high levels by 
stromal cells in the bone marrow and normally mediates HSC 
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retention in bone marrow niches (Broxmeyer et al., 2003b; Brox- 
meyer et al., 2005; Dar et al., 2006; Sugiyama et al., 2006; Zou 
et al., 1998). In addition, CXCR4 signaling promotes hematopoi- 
etic stem cell (HSC) quiescence, homing to bone marrow from 
blood and differentiation into committed myeloid progenitors 
(Broxmeyer et al., 2003a, 2003b; Kawai et al., 2007; Nie et al., 
2008; Sugiyama et al., 2006). 

Chromothripsis refers to multiple clustered genetic rearrange- 
ments and deletions affecting one or a few chromosomes (Ste- 
phens et al., 2011). The abnormalities are thought to occur all 
at once in a single cell, which then presumably either dies or ac- 
quires a growth advantage, depending on the genes affected 
(Stephens et al., 2011). Accordingly, chromothripsis was first 
identified by whole genome sequencing of cancer cell lines 
and has been reported to affect ~2% of all cancers (Jones and 
Jallepalli, 2012), as well as one patient with a severe congenital 
cognitive syndrome (Kloosterman et al., 201 1). Criteria for chro- 
mothripsis include (1) clustering of breakpoints in limited areas of 
one or several chromosomes with large intervening regions of 
normal sequence, (2) copy number states that suddenly oscillate 
between areas of normal heterozygosity and loss of heterozy- 
gosity, and (3) rearrangements affecting a single haplotype 
with multiple fragments rearranged in random orientation and 
order (Korbel and Campbell, 2013). Here, we describe chromo- 
thriptic deletions of one copy of chromosome 2, including dele- 
tion of the disease allele CXCR4 R334X , in a patient with WHIM 
syndrome that resulted in cure of the disease. 

RESULTS 

Cure of a Patient with WHIM Syndrome 

The index patient, designated WHIM-09, is a white female who 
presented at age 58 to the NIH requesting evaluation for herself 
and two of her three daughters, designated WHIM-10 (age 21) 
and WHIM-1 1 (age 23) (Figure 1 A). Both daughters had a history 
of recurrent infections since early childhood, multiple cutaneous 
warts, panleukopenia, and hypogammaglobulinemia, and there- 
fore fulfilled all the clinical criteria for WHIM syndrome. In 
contrast, WHIM-09 reported that from childhood through age 
38 she had had many serious infections, often requiring hospital- 
ization, but then none in the 20 subsequent years, and that she 
had had confluent warts on her hands that spontaneously 
resolved also in her 30s (Figure 1 B). Moreover, we found that 
at the time of presentation, WHIM-09 was not neutropenic, but 
instead had a mild leukocytosis, including an absolute neutrophil 
count (ANC) and absolute monocyte count (AMC) that were ~2- 
fold greater than the upper limit of normal; in contrast, the abso- 
lute lymphocyte count (ALC) was within normal limits. The past 
medical history revealed that WHIM-09 was in fact the first pa- 
tient ever described with myelokathexis, the key hematopatho- 
logic feature in WHIM syndrome, reported in two articles pub- 
lished in The New England Journal of Medicine (NEJM) in 1964 
(Krill et al., 1964; Zuelzer, 1964). She underwent a therapeutic 
splenectomy at age 9 for the possibility of autoimmune neutro- 
penia, which was ineffective. There is no evidence that her par- 
ents or siblings had WHIM syndrome. Thus, the history and clin- 
ical evidence were compatible with a WHIM mutation occurring 
de novo in patient WHIM-09, autosomal dominant transmission 



to two of her three daughters, and spontaneous and durable 
complete clinical remission in WHIM-09 in her fourth decade of 
life (Figure 1A). 

To evaluate potential mechanisms for clinical remission, we 
first graphed all available white blood cell counts for WHIM-09, 
including those previously published in the NEJM (Figure 1C). 
Consistent with the clinical history, this revealed severe neutro- 
penia at least from age 9 that was unaffected by splenectomy 
but that began to resolve spontaneously early in the fourth 
decade of life, rising slowly over time to a new and stable baseline 
slightly above the upper limit of normal. The AMC followed the 
same time course, whereas, interestingly, the ALC did not, start- 
ing in the normal range for healthy individuals as a child then 
increasing inconsistently and only slightly as an adult. Neverthe- 
less, when lymphocyte subsets were examined in detail, all B cell 
subsets and both naive CD4 + and CD8 + T cell subsets were 
below the lower limit of normal (Table 1), as they were in both 
daughters and most other patients reported with WHIM syn- 
drome. Consistent with this, WHIM-09 was slightly hypogamma- 
globulinemic at the time of presentation to NIH with IgG = 
535 mg/dl (normal range, 642-1 ,730 mg/dl). In contrast, memory 
CD4 + and CD8 + T cell subsets were elevated in WHIM-09, but 
deficient in both daughters. Unfortunately, WHIM-09’s archival 
lymphocyte subset values from the years when she fulfilled the 
clinical criteria for WHIM syndrome were not available. 

We considered myeloid leukemia or a possible pre-leukemic 
state as a cause of her mild leukocytosis; however, the patient 
was clinically well over this ~20-year time span when her neutro- 
phils and monocytes were increasing, and her blood smear and 
bone marrow histopathology at the NIH were not consistent with 
these diagnoses (Figure 1 D); moreover, specific testing for B and 
T cell clonality as well as for BCR-ABL and JAK mutations were 
negative (see Extended Experimental Procedures for details). 
Consistent with her apparent ~20-year complete remission of 
WHIM syndrome by clinical criteria, her bone marrow did not 
present the characteristic features of the disease (increased 
myeloid:erythroid ratio, neutrophil vacuolization, eyeglass nuclei 
in neutrophils), which were, however, all present in her bone 
marrow histopathology reported in the NEJM in 1964, shown 
again here for comparison, with permission, in Figure ID 
(Zuelzer, 1964). Since the patient reported she had undergone 
several prior surgeries and blood transfusions, we tested her 
blood for evidence of allogeneic chimerism and found none 
(data not shown). Thus, although the patient appeared to be clin- 
ically cured, she was hematologically mosaic, with sustained 
spontaneous correction of neutropenia, monocytopenia, and 
myelokathexis, and continued deficiency of B and naive T lym- 
phocytes in the blood. 

Patient WHIM-09 Is a Genetic Mosaic for CXCR4 R334X , 
the Most Common Mutation Causing WHIM Syndrome 

We next genotyped whole blood DNA from family members us- 
ing an established PCR-BsfUl restriction fragment length poly- 
morphism (PCR-RFLP) assay for the most common mutation 
in WHIM syndrome, CXCR4 R334X . Both affected daughters 
(WHIM-10 and WHIM-11) tested positive, whereas the unaf- 
fected husband and third unaffected daughter tested negative. 
Surprisingly, two independent whole blood samples from 
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Figure 1. Long-Term Clinical Remission of WHIM Syndrome and Evidence for Somatic Mosaicism in Patient WHIM-09 

(A) Family pedigree. Squares, males; circles, females; shaded symbols, WHIM syndrome; arrow, index patient WHIM-09. 

(B) Spontaneous and complete remission of warts in patient WHIM-09. According to patient WHIM-09, through her fourth decade of life she had had extensive 
warts on her hands, similar to her daughters (illustrated here for 24-year-old daughter WHIM-1 1), that spontaneously resolved. 

(C) Spontaneous sustained correction of neutropenia and monocytopenia in patient WHIM-09. WBC, white blood cell count; ANC, absolute neutrophil count; 
AMC, absolute monocyte count; ALC, absolute lymphocyte count. Arrows indicate age at splenectomy; horizontal lines indicate normal range for each cell type. 
Note, x axis is discontinuous to show pre/post splenectomy results more clearly. 

(legend continued on next page) 
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WHIM-09, in which the leukocyte content was composed mostly 
of neutrophils and monocytes (Figures 1 E and 1 F), also tested 
negative for the mutation. In contrast, DNA from WHIM-09 
PBMCs, which were composed mostly of lymphocytes, as well 
as from a lymphoblastoid cell line generated from WHIM-09 
PBMCs both tested positive for CXCR4 R334X (Figure 1 E). The 
mutation was also not detectable by direct sequencing of whole 
blood cell DNA from WHIM-09, whereas whole blood cell sam- 
ples from both daughters, WHIM-10 and WHIM-11, were both 
positive (Figure IF). In contrast, DNA samples from a cheek 
swab and fibroblasts cultured from a skin biopsy from WHIM- 
09 were both heterozygous for CXCR4 R334X (Figures IE and 
1G), defining her as a somatic genetic mosaic. A WHIM 
pedigree with germ-line/somatic genetic mosaicism, where the 
mother of two affected children was hematologically normal, 
has previously been described (Hernandez et al., 2003), but 
was unlikely to apply to WHIM-09 since she had been markedly 
symptomatic with severe neutropenia and myelokathexis as a 
child and cutaneous warts as an adult (Krill et al., 1964; Zuelzer, 
1964). Therefore, we investigated whether a genetic reversion 
had occurred. 

Chromothripsis as the Mechanism for Genetic 
Mosaicism in Patient WHIM-09 

The PCR-RFLP assay and DNA sequencing results may both be 
explained by either reversion or deletion in patient leukocytes of 
the mutant nucleotide that defines the CXCR4 R334X allele. To 
address this, we first performed cytogenetic analysis of bone 
marrow cells. In all 20 metaphase cells examined, one copy of 
chromosome 2, where CXCR4 is located, was acrocentric rather 
than submetacentric and was significantly shorter than its 
normal homolog (Figure 2A). All other chromosomes appeared 
normal. In addition to deletions, the banding pattern of the 
abnormal chromosome 2 suggested the presence of inversions. 
Fluorescence in situ hybridization (FISH) revealed that the 
anaplastic lymphoma receptor tyrosine kinase (ALK) gene, at 
2p23 on the normal chromosome 2, was on the long arm of the 
abnormal chromosome 2 (Figure 2B). FISH also demonstrated 
that the abnormal chromosome 2 had a portion of the centro- 
mere inverted into the long arm and the N-myc gene (MYCN), 
normally on 2p24, was absent (Figure 2C). Almost all of the inter- 
phase bone marrow cells with polymorphic nuclei, consistent 
with neutrophils, had the abnormal hybridization patterns of 
the abnormal chromosome 2 (Figure 2C), whereas cells with 
round nuclei had a significantly lower percentage of the 
abnormal chromosome (data not shown). An immunohistochem- 
ical stain of ALK activity on a bone marrow core biopsy section 



did not reveal abnormal activation of the enzyme despite its 
abnormal chromosomal location (Figure SI). 

To define the chromosomal abnormalities with greater reso- 
lution, microarray was performed using patient bone marrow 
cell and cultured fibroblast DNA. This revealed that the 
abnormal chromosome 2 was ~35 Mb shorter than usual due 
to seven large deletions (Figure S2A), as demonstrated by 
loss of heterozygosity in these regions and by an abrupt 
change in the relative copy number from 2 to 1 . One of the de- 
letions involved the MYCN gene, confirming the FISH results. In 
addition, one of the deletions included the position of the 
CXCR4 gene (Figure S2B). This had made the patient hemizy- 
gous for CXCR4 in cells having the abnormal chromosome 2 
with loss of the CXCR4 R334X mutation as well as loss of one 
copy of 163 other annotated genes (Table SI). Thus, develop- 
ment of hemizygosity of CXCR4, with only the wild-type copy 
remaining, explained why the CXCR4 R334X mutation was not 
detectable in neutrophil DNA by either DNA sequencing or 
PCR-RFLP. However, this potentially fortuitous deletion was 
clearly only a small part of a much larger and more complex 
genetic event. 

We next performed whole genome sequencing (WGS) 
for single nucleotide definition of the event, comparing 
blood neutrophil and skin fibroblast DNA from WHIM-09 
to each other and to the standard human genome sequence 
in GenBank (version hg19). We obtained an average of 40x 
coverage of both samples (database of Genotypes and 
Phenotypes [dbGaP] accession number phs000856.v1 .pi . 
[http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi? 
study_id=phs000856.v1 .pi]) and an analysis was performed 
to locate the DNA reads that had homology with multiple 
distinct areas of chromosome 2. This technique allowed pre- 
cise base pair level identification of the inversions and deletions 
that had occurred on one copy of chromosome 2 (Table S2). 
No other large deletions or inversions were detected in the 
genome. 

The changes seen in the microarray were confirmed by WGS 
but were actually more complex than had been initially sus- 
pected. The derivative chromosome was composed of 18 re- 
maining pieces arranged in random orientation and in a 
random order, characteristic of chromothripsis (Figure 3A). A 
circular plot of the connections revealed by paired end WGS 
of the neutrophil DNA is shown in Figure 3B. The abnormal de- 
rivative chromosome, modeled using the breakpoints and con- 
nections (Figure 3C), revealed a predicted structural banding 
pattern that was identical to what was observed by cytoge- 
netic analysis. We also developed primer pairs spanning four 



(D) Normalization of bone marrow pathology in patient WHIM-09. A representative high magnification (500x) Wright-Giemsa stain of the bone marrow aspirate is 
shown for the index patient WHIM-09 in 1963 (Zuelzer, 1964) (reproduced with permission) and in 2013 at ages 9 and 59, respectively. Arrows in left image, 
eyeglass nuclei in neutrophils. 

(E) PCR-BsfUl restriction fragment length polymorphism (BsfUl) analysis of genomic DNA. HD, healthy donor; WHIM-09, index patient; Cheek, cheek swab cells; 
PBMC, peripheral blood mononuclear cells; Fb, fibroblast; LCL, lymphoblastoid cell line; WB, whole blood; WT, wild-type allele; WHIM, CXCR4 R334X allele that 
causes WHIM syndrome. 

(F) Sanger DNA sequencing analysis of whole blood DNA for affected family members in the region near nucleotide position 1 ,000 (vertical line), the site of WHIM 
mutation CXCR4 R334X { 1,000 C -> T). Blue, C; Red, T; Fb, fibroblast; PMN, polymorphonuclear leukocyte (PMN). 

(G) Sanger DNA sequencing analysis of DNA from purified peripheral blood neutrophils, cells obtained from a buccal swab (cheek) and cultured skin fibroblasts for 
patient WHIM-09 in the same region as (F). 
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Table 1. Distribution of Leukocyte Subsets in the Peripheral 
Blood of Index Patient WHIM-09 in Clinical Remission from 
WHIM Syndrome and Her Two Affected Daughters, WHIM-10 and 
WHIM-11 


Leukocyte Subset 


WHIM-09 


WHIM-10 


WHIM-11 


Normal Adult 

Reference 

Range 


Neutrophil 


9,980 a 


380 b 


530 b 


1,560-6,130 


CD3 + T cell 


1,736 


382 b 


21 8 b 


714-2,266 


CD4 + T cell 


1,389 


31 7 b 


155 b 


359-1 ,565 


CD4 + CD45RA + 
naive T cell 


21 1 b 


44 b 


12 b 


454-733 


CD4 + CD45RO + 
memory T cell 


1,1 78 a 


273 


143 b 


219-1,048 


CD8 + T cell 


222 


41 b 


45 b 


178-853 


CD8 + CD45RA + 
naive T cell 


44 b 


22 b 


14 b 


231-371 


CD8 + CD45RO + 
memory T cell 


178 a 


19 b 


31 b 


57-130 


CD3" CD56 + NK 
cell 


1 ,270 a 


32 b 


51 b 


126-729 


CD19 + B cell 


18 b 


5 b 


10 b 


61-329 


CD19 + CD27 + 
memory B cell 


2 b 


1 b 


1 b 


12-68 


CD19 + CD27” 
B cell 


16 b 


4 b 


9 b 


90-176 


CD19 + CD27- lgD + 
lgM + transitional/ 
naive B cell 


12 b 


3 b 


4 b 


42-85 


CD19 + CD27~ IgD 
lgM + immature 
Bcell 


4 


1 b 


1 b 


2-10 


CD14 + CD16 
classical monocyte 


1 ,344 a 


35 b 


1 1 7 b 


371-539 


CD14 + CD16 + 

inflammatory 

monocyte 


235 a 


5 b 


23 


14-30 



Data are presented as absolute numbers of cells having the indicated 
immunophenotype per microliter of whole blood. 
a Value above the upper limit of normal. 
b Value below the lower limit of normal. 

c Based on the values of 11-40 healthy blood donors seen at the NIH 
Clinical Center. 



of the unique chromothriptic boundaries and demonstrated 
experimentally that they generated the predicted rearranged 
product from WHIM-09 neutrophil DNA but not from healthy 
donor DNA (data not shown). By analyzing the rearrangement 
breakpoints for the creation of novel fusion genes, we found 
two possibilities, fusions of MFSD2B with LOC285000 and 
POTEKP with NCKAP5 (Figure S3); however, these genes are 
unlikely to be expressed or functional because each lacks a 
promoter and transcription initiation site, and major regions 
of the potentially fused genes are deleted. Thus, the most likely 
explanation for the patient’s clinical improvement was haploin- 
sufficiency for one or more of the 164 genes affected by 
chromothripsis. 



Myeloid/Lymphoid Mosaicism Established by 
Chromothripsis in an HSC from Patient WHIM-09 

Chromothripsis-specific primers did not generate a detectable 
product by PCR when DNA prepared from the archival spleen 
sample from patient WHIM-09 was amplified, suggesting that 
the chromothriptic event occurred after age 9, when splenec- 
tomy was performed, but before age 35, when the rise in 
WBC was already underway (Figure 4A). The emergence of 
uniformly chromothriptic neutrophils, as suggested by WGS, 
PCR, and bone marrow cytogenetics and FISH, implied that 
at least some patient HSCs must also be chromothriptic. We 
verified this with CD34 + leukocytes cultured from peripheral 
blood, as well as with CD38“CD90 + CD45RA“ HSCs FACS- 
sorted from patient bone marrow (Figures 4B, 4C, and S4) 
that both appeared to be markedly deficient in or lack the 
CXCR4 R334X allele by the Bst\J\ PCR-RFLP assay. Consistent 
with this, CD38 + CD135 + CD45RA“ common myeloid precursor 
cells (CMP) and CD38 + CD135 + CD45RA + granulocyte-mono- 
cyte precursor cells (GMP) sorted from patient bone marrow 
also lacked the CXCR4 R334X allele by the PCR-RFLP assay, 
whereas FACS-sorted CD45 + CD34 + CD38 + CD10 + common 
lymphoid precursors (CLP) remained CXCR4 +/R334X (Figure 4D, 
lower panel). This pattern was confirmed by PCR using chro- 
mothripsis-specific primers and DNA from HSCs, CMPs, and 
CLPs (Figure 4D, upper panel). Consistent with this, we found 
that all mature myeloid cell types tested in both blood and 
bone marrow lacked or were markedly deficient in the 
CXCR4 R334X WHIM allele, whereas all mature lymphoid cell 
types tested in peripheral blood and bone marrow were het- 
erozygous CXCR4 +/R334X (Figures 4B and 4C). This aligns 
with the cytogenetic results presented in Figure 2C. Further, 
we found 100% of EBV-transformed B cell lines (n = 10) pre- 
pared from patient blood lacked the chromothriptic chromo- 
some (data not shown). As expected, erythroid precursors, 
which derive from CMPs, also lacked the CXCR4 R334X 
allele as demonstrated by Bst\J\ PCR-RFLP analysis of Burst 
Forming Unit-Erythroid (BFU-E) colonies generated from pa- 
tient CD34 + cells cultured from PBMCs ex vivo (Figure 4E). 
Thus, the combined evidence suggests that an HSC under- 
went chromothripsis and selectively repopulated the myeloid 
lineage (including the erythroid lineage), but not the lymphoid 
lineage (Figure 4F). 

Cxcr4 Haploinsufficiency Enhances Engraftment during 
Bone Marrow Transplantation 

To test whether deletion of the disease allele CXCR4 R334X 
alone, and not any of the other 163 genes deleted by chromo- 
thripsis would be sufficient to confer a selective advantage to 
an HSC in the context of WHIM syndrome, we performed 
competitive repopulation experiments using a mouse model 
of WHIM syndrome ( Cxcr4 +/S338X ), hemizygous Cxcr4 mice 
(Cxcr4 +/o , the same as the HSC and myeloid cell CXCR4 geno- 
type of patient WHIM-09), and wild-type ( Cxcr4 +/+ ) mice as 
donors and recipients. Leukocytes from these strains were 
specifically marked with either CD45.1 or CD45.2 or both, al- 
lowing tracking of transplanted donor cell fate in vivo by 
FACS. When ~2.5 x 10 6 total donor bone marrow cells from 
both Cxcr4 +/S338X and Cxcr4 +/o mice were mixed together in 
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Figure 2. Massive Deletion and Rearrangement of 
One Copy of Chromosome 2, the Location of 
CXCR4, in Patient WHIM-09 

Cytogenetics and fluorescence in situ hybridization (FISH) 
analyses were performed on cultured bone marrow cells. 

(A) Cytogenetics. Karyogram showing a short acrocentric 
chromosome 2 (arrow) with abnormal banding pattern 
suggesting deletions and inversions observed in all 20 
metaphase cells analyzed from bone marrow. 

(B) FISH with intact ALK break apart probe (Abbott 
Molecular) signal at 2p23 on normal chromosome 2 and on 
long arm of abnormal chromosome 2 (der 2). 

(C) FISH with MYCN and CEP 2 probe set (Abbott Molec- 
ular). Left: metaphase cell showing the normal chromo- 
some 2 with intact centromere signal (red) and MYCN 
signal (green) at 2p24. The abnormal chromosome 2 has a 
portion of its centromere inverted into the long arm splitting 
the red signal; the green MYCN signal is absent. Right: 
polymorphonuclear and round interphase nuclei showing 
the abnormal hybridization pattern with one green MYCN 
signal and three red CEP 2 signals. One of the red signals is 
smaller than the other two and is close to one of them. See 
also Figure SI. 
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Figure 3. Chromothripsis as the Mechanism for Loss of CXCR4 R334X in Patient WHIM-09 

Purified neutrophil and cultured skin fibroblast DNA from patient WHIM-09 was isolated and subjected to whole genome sequencing with paired-end analysis. 

(A) Linear non-proportional plot of the abnormal copy of chromosome 2 in patient WHIM-09 labeled from the p arm telomere (0) to the q arm telomere (243) in 
megabases with the 18 remaining pieces arranged in their numeric order (top). Connections between these pieces are depicted by the curved lines. Note that 
some connections were poorly defined because of the involvement of repetitive centromeric sequence (black lines). The order and orientation of the 1 8 remaining 
pieces in the derivative chromosome are indicated at the bottom. 

(B) Circos plot of chromosome 2 and its normal Giemsa cytogenetic banding pattern labeled from the p arm telomere (0) to the q arm telomere (—240) in 
megabases. Large pieces of chromosome 2 were missing from patient WHIM-09 neutrophil DNA and the 18 remaining pieces were arranged in random order. 
Connections between these pieces and their orientation (inset) are depicted by the colored lines. See Figure S2 and Tables SI and S2 for additional details. Note 
that two connections were poorly defined because of the involvement of repetitive centromeric sequence. The inner circular trace is the copy number variation 
data derived from microarray analysis (Figures S2A and S2B). Note that the sites of connections derived from the paired-end sequencing analysis closely match 
the sites where copy number variation abruptly falls from 2 to 1 . The location of CXCR4 is indicated at lower left. 

(C) Derivative chromothriptic chromosome 2. Ideogram of intact chromosome 2 (left) and a model of the Giemsa cytogenetic banding pattern and the order and 
orientation of the 18 pieces with the deletions called by microarray shaded in yellow (right) are shown. The resultant remaining chromosome 2 banding pattern 
predicted by whole genome sequencing closely matches that seen by cytogenetic analysis. Note the location of CXCR4 at 2q22.1 in one of the deleted segments. 
See also Figure S3. 
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Figure 4. Chromothriptic CXC/?4-Haploinsufficient HSC Replacement of the Myeloid Lineage, but Not the Lymphoid Lineage, Is Associated 
with Clinical Remission in Patient WHIM-09 

(A-E) Representative results from a BsfUl PCR-restriction fragment length polymorphism assay (BsfUl), designed to distinguish the wild-type CXCR4 allele (WT) 
from the CXCR4 R334X WHIM allele (WHIM), as well as from a PCR assay specific for the chromothriptic chromosome (13-16 Jxn). PCR was performed on DNA 
obtained from the indicated donor leukocyte subsets purified either from blood using magnetic bead purification (B and E) or from a bone marrow aspirate using 
flow cytometric sorting (C and D). DNA was also prepared from archived WHIM-09 spleen and compared with peripheral blood PMN DNA (A), as well as from 
Burst-forming Unit-Erythroid colonies and compared with blood leukocyte subsets (E). WHIM-09, index patient; HD, healthy donor; Spl, spleen; PMN, poly- 
morphonuclear leukocytes; 1 3-1 6 Jxn, PCR product specific for the chromothriptic junction between segments 1 3 and 1 6 of the chromothriptic chromosome of 
patient WHIM-09; PBMC, peripheral blood mononuclear cells; CD4, purified CD4 + T cells; CD8, purified CD8 + T cells; CD1 9 purified CD19 + B cells; CD56, purified 
CD56 + natural killer cells; CD14, purified CD14 + monocytes; CD34, purified CD34 + hematopoietic cells; CD3, purified CD3 + T cells; CD15, purified CD15 + 
neutrophils; CD16, purified CD16 + neutrophils; HSC, hematopoietic stem cells; CLP, common lymphoid precursor; CMP, common myeloid precursor; 
GMP, granulocyte/monocyte precursor MEP, megakaryocyte-erythroid precursor; BFU-E, Burst-forming Unit-Erythroid; CXCR4, CXCR4 amplicon not digested 
with BsfUl. 

(F) Summary of myeloid/lymphoid mosaicism for CXCR4 R334X in patient WHIM-09. The immunophenotype used to purify each cell type from enriched 
CD34 + CD45 + cells is summarized next to each cell type shown. Red, negative for CXCR4 R334X \ green, positive for CXCR4 R334X \ asterisks, purified cell types 
directly analyzed by PCR-RFLP for the WHIM mutation. 

See also Figure S4. 



equal proportions (~50%:50%) and transplanted into lethally 
irradiated Cxcr4 +/+ recipients, by day 105 post-transplantation 
the percentage of Cxcr4 +/S338X neutrophils, monocytes, and B 
cells detectable in the peripheral blood had declined to ~5%, 
whereas the percentage of each of the corresponding Cxcr4 +/ ° 
leukocyte subsets had increased to ~95% (Figure 5A). The 
rate at which the proportion of donor-derived blood cells 



diverged in frequency from the input ratio of 50:50 occurred 
in two phases, a rapid early phase within 2 weeks and a 
slow late phase evident by 2 weeks after transplantation. The 
inflection point could be earlier than 2 weeks, since this is 
the first time point when data were collected to allow for recov- 
ery after transplantation. Importantly, there was no difference 
in the baseline HSC frequency in bone marrow for Cxcr4 +/ ° 



Cell 160, 686-699, February 12, 2015 ©2015 Elsevier Inc. 693 










Cell 



II 



Donor 

Cxcr4 +/o 




Recipient 

lethal 

irradiation 

X 



CD45.1 



CD45.1/CD45.2 



Donor cells 




o io* io J icr icr 

CD46.1 



Donor-derived blood 
cells at day 105 



Cxcr4 +/S338X 




III 



■o Cp 

® S^ioo- 

” </) 80- 
0 — 

■O 0 60- 

£ if 40. 

o ^ 

C O 20- 

° ° n 

q s °; 



CD45 + 



Ly6G + 



CD11b + 



CD19 + 



CD3 + 





f f f t j 


t t t 


t j, ~ 1 










D 20 40 60 80 100 120 ( 


) 20 40 60 80 100 120 ( 


) 20 40 60 80 100 120 ( 


) 20 40 60 80 100 120 C 






20 40 60 80 100 120 



Days post bone marrow transfer 
(-•- Cxcr4 +/o Cxcr4 +/S338X ) 



B 



Donor 

Cxcr4 +/o 




Recipient 



CD45.1> 



N. 



CD45.1 



Qp lethal 
irradiation 



CD45.2 



III 



■a 5 s 

0 £^ 100 - 

j/) SO- 
TS 0 60-| 

1 « 40 
C O 20 

o O 
On 0 



100 200 300 




Donor-derived blood 
cells at day 303 



Cxcr4 +/o 



S 10 3 

o 




Cxcr4 +/+ 



Cxcr4* /+ 



CD45 + 


Ly6G + 

I tw 


CD11b + 


CD19 + 


CD3 + 










i 1 1 i V* 













100 200 300 



100 200 300 



100 200 300 



100 200 300 



Days post bone marrow transfer 
( Cxcr4 +/o Cxcr4 +/+ } 



Figure 5. Cxcr4 Haploinsufficiency Enhances HSC Engraftment during Mouse Bone Marrow Transplantation 

Two types of competitive bone marrow transplantation experiments were performed: (A) Cxcr4 +, ° versus Cxcr4 +/S338X (mouse model of WHIM syndrome), and (B) 
Cxcr4 +/ ° versus Cxcr4 +/+ . (/) Experimental design. (/'/) Representative flow cytometry plots demonstrating the relative contributions of CD45 congenic markers in 
mixed donor bone marrow prior to transplantation (left panel) and in blood after bone marrow transplantation (right panel) in a single mouse. (///) Cell frequency 
data for the leukocyte subsets indicated at the top of each panel, presented as the mean ± SEM percentage (%) of total donor-derived cells for each subset (n = 
1 0 mice per data point). SEM was <5% of the mean in all cases and therefore is not visible for most data points. Results were verified in one and two additional 
independent experiments for (A) and (B), respectively. 

See also Figure S5. 



and Cxcr4 +/S338X donor mice, so the number of HSCs trans- 
planted was similar (Figure S5). Since the ANC in the 
peripheral blood of Cxcr4 +/S338X donor mice is only ~50% 
less than the corresponding value in both Cxcr4 +/+ mice (Bala- 



banian et al., 2012; J.-L.G., M.S., E.Y., and P.M.M., data not 
shown ) and Cxcr4 +/o mice (J.-LG., unpublished data), the 
extreme skewing of these transplantation results in the blood 
suggested that hemizygous Cxcr4 ( Cxcr4 +/ ° ) HSCs may have 
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a selective advantage over WHIM (( Cxcr4 +/S338X ) HSCs for 
engraftment in this system. Consistent with this, in the bone 
marrow, as in the blood, we observed a greater content of 
mature Cxcr4 +/o leukocytes compared to mature Cxcr4 +/S338X 
leukocytes (data not shown). No difference was observed for 
the frequencies of Cxcr4 +/ ° and Cxcr4 +/S338X T cells in the 
blood, presumably owing to homeostatic proliferation of 
mature T cells present in donor bone marrow transferred into 
irradiated hosts. 

To test whether there might be a general selective advantage 
of Cxcr4 hemizygosity in transplantation, we performed compet- 
itive repopulation experiments by mixing ~2.5 x 10 6 total donor 
bone marrow cells each from Cxcr4 +I+ and Cxcr4 +/o mice and 
transplanting the mixture into lethally irradiated Cxcr4 +/+ recipi- 
ents. The baseline HSC frequency in bone marrow was the 
same for Cxcr4 +/o and Cxcr4 +/+ donor mice (Figure S5). In this 
case, the input ratio was skewed slightly in favor of the 
Cxcr4 +/+ cells over the Cxcr4 +/ ° cells (58%:42%). Nevertheless, 
by day 303 post-transplantation, when the animals were sacri- 
ficed, the percentage of Cxcr4 +/+ neutrophils, monocytes and 
B cells detectable in the peripheral blood had declined to less 
than ~1 5% , whereas the percentage of each of the correspond- 
ing Cxcr4 +/ ° leukocyte subsets had increased to ~85% with 
similar kinetics as for the competition with Cxcr4 +/S338X bone 
marrow (Figure 5B). The same effect was observed whether 
the irradiated recipient mouse was CD45.1 , CD45.2, or Cxcr4 +/o 
(data not shown) and whether the donor bone marrows were 
depleted of lineage-positive cells (Figure S6). 

To investigate the mechanism for the apparent competitive 
advantage of Cxcr4 +/o over Cxcr4 +/+ bone marrow cells for re- 
constituting the blood, we first measured the proliferative status 
of the corresponding HSCs in vivo by BrdU incorporation early 
after transplantation (day 7) (Figure 6A). The results indicated 
~20% greater frequency of BrdlT Cxcr4 +/ ° HSCs as compared 
to Cxcr4 +/+ HSCs suggesting a proliferative advantage of 
Cxcr4 +/ ° HSCs in the bone marrow. To test whether differential 
retention of leukocytes in the bone marrow might also 
contribute to the skewed distribution of mature leukocytes in 
the blood, we sacrificed mice at day 303 after competitive 
transplantation with Cxcr4 +/o and Cxcr4 +/+ bone marrow cells 
(the same combination of donor bone marrow cells as were 
analyzed in the proliferation experiments). The results showed 
that Cxcr4 +/ ° HSCs, HPCs (hematopoietic progenitor cells), 
and total CD45 + cells (mostly mature leukocytes) predominated 
in the bone marrow by the same 4:1 ratio over the correspond- 
ing Cxcr4 +/+ cells as was found for mature leukocytes in the 
blood. Thus, the predominance of mature Cxcr4 +/o over 
Cxcr4 +/+ leukocytes in the blood of competitively transplanted 
mice cannot be simply explained by low retention of mature 
Cxcr4 +/ ° leukocytes relative to the retention of mature Cxcr4 +/+ 
leukocytes in the bone marrow. Moreover, the results at this 
very late time point clarify that enhanced Cxcr4 +/ ° HSC prolifer- 
ation does not result in long-term depletion of HSCs. The fre- 
quency distribution of stem and progenitor subtypes was the 
same for each genotype, indicating no block to progenitor 
cell differentiation (Figure 6B). 

Finally, when Cxcr4 +/o bone marrow was used in competi- 
tive repopulation assays with either Cxcr4 +/+ or Cxcr4 +/S338X , 



we found in both cases that the total white blood cell and dif- 
ferential counts in the blood post-transplantation were not 
significantly altered as compared to reference values for 
C57BI/6J mice (http://phenome.jax.org), suggesting that 
Cxcr4 +/ ° genotype does not predispose to myelodysplasia or 
leukemia in the setting of this transplantation model (Fig- 
ure S7). Moreover, there is no evidence that Cxcr4 +/ ° mice 
have a constitutive predisposition to myelodysplasia (data 
not shown). 

DISCUSSION 

Our analysis of this unprecedented experiment of nature in a 
single patient provides the first evidence that chromothripsis, 
a newly described form of genetic instability, may result in clin- 
ical benefit, in particular, cure of an inherited disease. This pa- 
tient also provides the second example of chromothripsis 
occurring in a stem cell still capable of terminal differentiation, 
(Kloosterman et al., 201 1) as well as one of the few examples of 
spontaneous genetic cure of a Mendelian condition (Hirsch- 
horn, 2003). 

We use the word “cure” to refer to the patient’s clinical 
outcome because she had met all four of the acronymic clinical 
criteria diagnostic for WHIM syndrome through her fourth 
decade of life, but has fulfilled none except for mild hypogam- 
maglobulinemia since then (~20 years, to date). Moreover, the 
disease allele CXCR4 R334X is no longer detectable in key cell 
types that drive the disease: HSCs, neutrophils, and mono- 
cytes. To our knowledge, patient WHIM-09’s spontaneous 
long-term complete remission of warts without treatment is un- 
precedented in WHIM syndrome (Al Ustwani et al., 2014; Beau- 
ssant Cohen et al., 2012; Kawai and Malech, 2009; Tassone 
et al., 2009; Wetzler et al., 1990). Since she remains lympho- 
penic (B and naive T lymphocytes), this suggests that the 
myeloid arm of the immune system, probably through mono- 
cytes or monocyte-derived cells, plays an essential role in 
HPV clearance. 

The combined evidence suggests that an HSC in WHIM-09 
underwent chromothripsis between the second and fourth 
decade of life and selectively repopulated the myeloid lineage, 
but not the lymphoid lineage. This pattern of hematologic 
mosaicism implies that the chromothriptic changes precluded 
differentiation of HSCs to CLPs but not to CMPs. Thus, the 
mechanism for maintenance of the B cell lineage, which is 
WHIM in the patient, is unclear. Possibilities include differenti- 
ation from a small population of persistent WHIM HSCs below 
the level of detection of our assays, or by differentiation of 
self-sustaining or very long-lived WHIM CLPs. In contrast, 
the T cell lineage may simply have been maintained by 
homeostatic proliferation of the pre-chromothriptic WHIM 
T cell repertoire. 

Chromothripsis is a complex chromosomal catastrophe that is 
thought to occur all at once in one cell (Stephens et al., 2011). If 
the affected cell dies, chromothripsis is clinically silent and unde- 
tectable, therefore the true frequency with which it occurs 
cannot be determined. If the chromothriptic cell acquires a 
strong selective advantage, it may emerge as a readily detect- 
able, clinically apparent clonal population harboring a single 
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Figure 6. Cxcr4 Haploinsufficiency Enhances HSC Proliferation and Engraftment in a Mouse Bone Marrow Transplantation Model 

(A) Proliferation. Donor bone marrow cells with a Cxcr4 +/ ° genotype on a homozygous CD45.2 background were mixed with donor bone marrow cells with a 
Cxcr4 +I+ genotype on a heterozygous CD45.1/CD45.2 background (47:53) and injected intravenously into a lethally irradiated CD45.1 recipient mouse. Six days 
after bone marrow transplantation, each mouse was given 1.25 mg of BrdU intraperitoneally (i.p.). Twenty hours later, the mice were euthanized for HSC pro- 
liferation analysis. (/) Gating scheme for BrdlT HSCs. Bone marrow cells were first gated with CD45.2 ( Cxcr4 +/ ° ) and CD45.1/CD45.2 (Cxcr4 +/+ ), then HSCs were 
gated as Flt3 Lin - Sca1 + c-Kit + (Flt3LSK), which includes long-term and short-term HSCs, and BrdlT cells were quantitated. (/'/) Percentage of BrdlT HSCs in 
each donor. Data are expressed as mean ± SEM from four mice. The experiment was repeated once with similar results. 

(B) Long-term engraftment and differentiation. Bone marrow cells from donor mice with a Cxcr4 +/ ° genotype on a CD45.2 background were mixed with bone 
marrow cells from donors with a Cxcr4 +/+ genotype on a CD45.1 background (42:58) and the mixed cells were injected intravenously into lethally irradiated 
recipient mice. Bone marrow was harvested 303 days later. (/) Gating scheme for long-term HSC (LT-HSC: CD34Flt3 - Lin - Sca1 + c-Kit + ), short-term HSC 
(ST-HSC: CD34 + Flt3 Lin Sca1 + c-Kit + ), multipotent progenitors (MPP: CD34 + Flt3 + Lin - Sca1 + c-Kit + ), and common lymphoid progenitors (CLP: IL7ra + Lin - 
Sca1 low c-Kit low ). (/'/) Long-term engraftment. {Hi) Differentiation. The distribution frequency of bone marrow cell subsets is similar for Cxcr4 +/ ° compared to 
Cxcr4 +/+ donor-derived cell populations. Each data point represents ten mice presented as the mean ± SEM. 

See also Figures S6 and S7. 
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pattern of deletions and rearrangements, resulting for example 
either in cancer, as previously reported (Jones and Jallepalli, 
201 2), or if the location of the event is fortuitous, in cure of a ge- 
netic condition, as occurred in patient WHIM-09. Although it is 
possible that deletion of the disease allele CXCR4 R334X in the 
original chromothriptic HSC of patient WHIM-09, which rendered 
the cell CXCR4 haploinsufficient ( CXCR4 +/o ), may have been suf- 
ficient to repopulate the myeloid lineage— as suggested by the 
strong HSC engraftment advantage of Cxcr4 haploinsufficient 
mouse bone marrow cells in our competitive bone marrow trans- 
plantation experiments— other factors, particularly haploinsuffi- 
ciency for one or more of the 1 63 other genes that were deleted 
by chromothripsis, may also have contributed. In this regard, at 
least three of the other 163 deleted genes, DNMT3A (Challen 
et al., 2012), MYCN (Laurenti et al., 2008), and IL1R (Orelio 
et al., 2009), have been reported to regulate hematopoiesis 
(Table SI). 

That Cxcr4 haploinsufficiency should provide a competitive 
engraftment advantage to HSCs is somewhat counterintuitive 
since Cxcr4 has been well-documented to promote HSC hom- 
ing to and retention in bone marrow (Broxmeyer et al., 2003a, 
2003b, 2005; Dar et al., 2006; Kawai et al., 2007; Nie et al., 
2008; Sugiyama et al., 2006). However, it has also been re- 
ported that CXCL12 secreted by CXCL12-abundant reticular 
(CAR) cells that contact HSCs in the bone marrow niche is 
able to enhance HSC quiescence by stimulating CXCR4 
signaling (Nie et al., 2008; Sugiyama et al., 2006). The overall 
effect on engraftment of reducing signaling via CXCR4 haploin- 
sufficiency has not been previously tested in vivo and cannot 
be confidently predicted given the diverse and contrasting 
roles of the receptor on HSCs as well as on mature leukocytes. 
Our mouse data confirmed in vivo that Cxcr4 haploinsuffi- 
ciency enhanced HSC proliferation while maintaining long- 
term hematopoiesis, therefore a competitive advantage 
acquired from enhanced proliferation and potentially other 
mechanisms by these cells may supercede any potential 
disadvantage they may have at the level of bone marrow reten- 
tion and homing. 

Additional work using more stringent engraftment protocols in 
mice and ultimately gene editing/transplantation trials in humans 
will be needed to precisely determine the impact of CXCR4 hap- 
loinsufficiency on HSC engraftment as well as on each of the pa- 
rameters that regulate the physiologic steady-state levels of 
mature leukocytes and hematopoietic stem and progenitor cells 
in blood, bone marrow, and other hematopoietic compartments. 
Such studies may further elucidate precisely how patient 
WHIM-09 was cured, which may point to general applications 
in transplantation and gene therapy. 

EXPERIMENTAL PROCEDURES 
Patients 

Consistent with the Declaration of Helsinki, all human subjects signed 
informed consent to participate in NIAID Institutional Review Board-approved 
clinical protocols. All subjects were studied at the NIH Clinical Center. 

Human Leukocyte Analysis 

Leukocyte subsets were isolated from blood and bone marrow. PBMCs 
and bone marrow cells were cultured ex vivo to expand CD34 + hematopoietic 



progenitor cells and to generate BFU-E colonies. PBMCs were used to create 
lymphoblastoid cell lines (LCL) from Epstein-Barr virus (EBV)-transformed B 
cells. 

Genetic Analysis 

BsfUl PCR-RFLP analysis was performed as previously described (Hernandez 
et al., 2003) using patient genomic DNA. The genome was also analyzed by 
PCR using primers spanning chromothriptic junctions of the derivative chro- 
mosome 2 of patient WHIM-09; by Sanger method sequencing; by standard 
cytogenetics and fluorescence in situ hybridization (FISH) of metaphase 
chromosomes (Quest Diagnostics) using Abbott Molecular probes (Abbott 
Molecular); by microarray analysis using the Affymetrix Cytoscan HD array 
(Affymetrix); and by whole genome sequencing (WGS). For WGS, paired-end 
patient WHIM-09 fibroblast and neutrophil DNA libraries were prepared using 
the TruSeq DNA protocol (lllumina) and were sequenced using a HiSeq 2000 
next generation sequencer (lllumina) generating ~380 million reads per library 
of 2 x 100 bp paired-ends. Reads were mapped to build hg19 of the human 
genome to identify fusion junctions present in the neutrophil but not in the 
fibroblast samples. 

Transplantation Experiments 

Cxcr4 floxed mice (Strain 008767, B6.129P2-Cxcr4 fm2y ' z 7J) and Ella promoter 
driven Cre recombinase transgenic mice (Strain 003724, B6.FVB-Tg(Ella-cre) 
C5379Lmgd/J) were obtained from the Jackson Laboratory and bred together 
to generate Cxcr4 +/ ° mice on a homozygous CD45.2 background. Cxcr4 +/+ 
mice were obtained from Jackson Laboratory on the homozygous CD45.1 
and homozygous CD45.2 backgrounds. Creation of WHIM knockin mice 
bearing a heterozygous Cxcr4 S338X mutation has been previously described 
(Balabanian et al., 2012), and sperm from these mice were used for in vitro 
fertilization at NIH to impregnate female C57BL/6 mice from Taconic Farms 
to produce Cxcr4 +/S338X on a homozygous CD45.2 background. Both 
Cxcr4 +/ ° and Cxcr4 +/S338X mice on the homozygous CD45.2 background 
were bred to Cxcr4 +/+ mice on a homozygous CD45.1 background to produce 
Cxcr4 +/ ° and Cxcr4 +/S338X mice on a heterozygous CD45.1/CD45.2 back- 
ground. In this way, differentially marked donor bone marrow cells could be 
tracked in vivo during competitive transplantation experiments. Five million 
bone marrow cells (~50% from each donor) were transferred via tail vein injec- 
tion into recipient mice that had undergone lethal irradiation (900 rads) 8 hr 
prior to transplant. The mice were fed with neomycin water for 4 weeks 
post-irradiation. All animal experiments were performed using an NIAID Animal 
Care and Use Committee (ACUC) approved protocol in approved and certified 
facilities. 

For additional details, see Extended Experimental Procedures. 
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SUMMARY 

PTX3 is an essential component of the humoral arm 
of innate immunity, playing a nonredundant role in 
resistance against selected microbes and in the 
regulation of inflammation. PTX3 activates and regu- 
lates the Complement cascade by interacting with 
Clq and with Factor H. PTX3 deficiency was associ- 
ated with increased susceptibility to mesenchymal 
and epithelial carcinogenesis. Increased suscepti- 
bility of Ptx3~ f ~ mice was associated with enhanced 
macrophage infiltration, cytokine production, angio- 
genesis, and Trp53 mutations. Correlative evidence, 
gene-targeted mice, and pharmacological blocking 
experiments indicated that PTX3 deficiency resulted 
in amplification of Complement activation, CCL2 pro- 
duction, and tumor-promoting macrophage recruit- 
ment. PTX3 expression was epigenetically regulated 
in selected human tumors (e.g., leiomyosarcomas 
and colorectal cancer) by methylation of the pro- 
moter region and of a putative enhancer. Thus, 
PTX3, an effector molecule belonging to the humoral 
arm of innate immunity, acts as an extrinsic oncosup- 
pressor gene in mouse and man by regulating Com- 
plement-dependent, macrophage-sustained, tumor- 
promoting inflammation. 

INTRODUCTION 

Inflammatory cells and molecules are an essential component of 
the tumor microenvironment (Coussens et al., 2013; Grivennikov 
et al., 2010; Hanahan and Weinberg, 2011; Mantovani et al., 

2008). The connection between inflammation and cancer can 
be viewed as consisting of two pathways. Selected inflammatory 
conditions, the prototype of which is ulcerative colitis, increase 
the risk of developing cancer (extrinsic pathway). On the other 
hand genetic events that cause cancer orchestrate the construc- 
tion of an inflammatory microenvironment even in tumors that are 
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epidemiologically unrelated to inflammation (Mantovani et al., 
2008). Genetic approaches have unequivocally proven the role 
of cells and mediators (e.g., cytokines) of the innate and adaptive 
immune system in sustaining tumor-promoting inflammation. 
However, no dominant or suppressor oncogene encodes for 
effector molecules of the humoral innate immune system. 

The innate immune system includes a cellular and a humoral 
arm (Garlanda et al., 2005). The humoral arm of innate immunity 
is constituted of diverse molecules including Complement 
components, collectins, ficolins, and pentraxins (Bottazzi et al., 
2010). The long pentraxin PTX3 is an essential component of 
the humoral arm of innate immunity (Bottazzi et al., 2010; Gar- 
landa et al., 2005). PTX3 acts as a functional ancestor of anti- 
bodies: it interacts with selected microbial molecules (Jeannin 
et al., 2005); it has opsonic activity via Fey receptors (Moalli 
et al., 2010); it activates and regulates the Complement cascade 
by interacting with Cl q and Factor H (Bottazzi et al., 1 997; Deban 
et al., 2008); and it regulates inflammation by interacting with 
P-selectin via its glycosidic moiety (Deban et al., 2010). PTX3 
plays a nonredundant role in resistance against selected microbi- 
al pathogens in mouse and man (Chiarini et al., 2010; Cunha et al., 
201 4; Garlanda et al., 2002) and is a candidate therapeutic agent 
against Aspergillus fumigatus and Pseudomonas aeruginosa. 

In the context of an effort to translate this molecule to the clinic 
as a novel candidate antimicrobial agent in cancer patients at 
risk of infection (Cunha et al., 2014), it was important to assess 
its role in carcinogenesis. We found that the humoral pattern 
recognition molecule PTX3 acts as an oncosuppressor in mice 
and humans by regulating Complement-dependent tumor-pro- 
moting inflammation. This observation represents a missing 
link in the connection between inflammation and cancer by 
providing genetic evidence that an effector molecule of humoral 
innate immunity can act as a cancer gene. 

RESULTS 

PTX3 Deficiency Increases Susceptibility to Cancer 
Development and Growth 

In order to address the role of PTX3 in cancer-related inflamma- 
tion (CRI) and tumor development, we analyzed the susceptibility 
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Figure 1. PTX3 Deficiency Increases Susceptibility to Carcinogenesis 

(A and B) Incidence (A) and tumor volume (B) of 3-MCA-induced sarcoma in Ptx3 f and Ptx3 +/+ mice. In (B), the volume of each tumor is shown. One experiment 
with 20 mice/group out of ten performed with similar results is shown. 

(C) Effect of recombinant PTX3 (9 |ag/day, s.c. by osmotic pump) administered from day 0 to 50,-100, or from 0-100 on tumor incidence. 

(D and E) Incidence (D) and number of lesions (mean ± SEM) (E) of DM BA/TP A- induced skin papillomas in Ptx3 f and Ptx3 +/+ mice. One experiment with ten 
mice/group out of five performed with similar results is shown. 

(F) Histological classification of lesions developed in Ptx3~ l and Ptx3 +/+ mice. **p < 0.01, Chi-square test. Two pooled experiments. 

(A-E) *p < 0.05, **p < 0.01 , ***p < 0.001 paired Student’s t test. See also Figure SI . 



of Ptx3~ f ~ mice in different models of mesenchymal or epithelial 
carcinogenesis. As shown in Figure 1 A, in the model of 3-Meth- 
ylcholanthrene (3-MCA)-induced carcinogenesis, 80% Ptx3~ f ~ 
mice developed a sarcoma in comparison to 40% Ptx3 +/+ mice 
(p = 0.0002). Ptx3~ / ~ sarcomas grew faster than Ptx3 +/+ tumors 
and reached the endpoint (3 cm 3 ) in 100-120 days, whereas 
Ptx3 +/+ tumors reached it in 1 70-220 days (Figure 1 B). Treatment 
with recombinant PTX3 subcutaneously (s.c.) throughout the 
carcinogenesis process (days 0-100) completely rescued the 
phenotype of Ptx3~ / ~ mice (Figure 1C). Early treatment (days 
0-50) inhibited the early difference, with an increase in incidence 
following interruption of treatment. A minor decrease in tumor 
incidence was also observed when treatment was begun on 
day 50 through day 100 (Figure 1C). The impact of PTX3 admin- 
istration after day 50 may well reflect a role of Complement 
(see below) in promotion of established transplanted tumors 
(Markiewski et al., 2008). Treatment of Ptx3 +/+ mice caused 
some delay in tumor appearance but did not affect incidence 
(Figure SI A available online). 

We next addressed the relevance of PTX3 deficiency in a 
model of 7,12-dimethylbenz [a] anthracene/terephthalic acid 
(DMBA/TPA)-induced skin carcinogenesis. As shown in Figures 
1 D and 1 E, both the incidence and the multiplicity of papillomas 
were significantly increased in Ptx3~ f ~ mice in comparison to 
Ptx3 +/+ mice (p = 0.0004 and p = 0.0005, respectively). In addi- 



tion, the number of lesions evolving to skin carcinomas was 
significantly higher in Ptx3~'~ mice (p = 0.009), suggesting a 
more aggressive behavior of DMBA/TPA- induced skin lesion in 
PTX3-deficient hosts (Figure 1 F). 

These data indicate that PTX3 is involved in controlling the 
incidence and growth of tumors of both mesenchymal and 
epithelial origin. 

Production of PTX3 during Carcinogenesis 

The following experiments, aimed at defining the underlying 
cellular and molecular mechanisms, were focused on the 3- 
MCA-induced carcinogenesis model. We first analyzed the pro- 
duction of PTX3 during carcinogenesis. 3-MCA is a carcinogenic 
compound acting as initiator and promoter, metabolized by the 
liver. PTX3 levels were increased in serum (Figure 2A) and liver 
(not shown) a few hours (8 and 24) after 3-MCA treatment, 
whereas after 7 days they had returned to basal levels. At the in- 
jection site, the levels of PTX3 were higher 24 hr after 3-MCA 
treatment in comparison to vehicle and further increased at 
day 7. PTX3 was originally cloned as an IL-1 inducible gene (Gar- 
landa et al., 2005). It was therefore important to assess whether 
IL-1 was upstream of PTX3 in 3-MCA carcinogenesis. Both sys- 
temic and local PTX3 levels were significantly lower in lHr1~'~ 
mice compared to wild-type mice, suggesting that in this model, 
IL-1 is involved in PTX3 induction (Figure 2A). In established 
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Figure 2. Analysis of the Cellular Source of PTX3 and of Its Role in 3-MCA-lnduced Carcinogenesis 

(A) Systemic (upper panel) and local (lower panel) PTX3 production at different time points upon vehicle or 3-MCA injection in wild-type and 11-1 rl -/ mice (mean ± 
SEM). *p < 0.05, **p < 0.01 , ***p < 0.001 , unpaired Student’s t test. 

(B) Immunohistochemical analysis of PTX3 expression in 3-MCA-induced sarcoma. Arrows show vessels (upper panel), infiltrating leukocytes (central panel), and 
interstitial stroma (lower panel). Arrowheads show tumor cells (upper and central panels). 

(C) Immunofluorescence analysis of PTX3 expression in blood vessels (CD31 + cells), macrophages (F4/80 + cells), and neutrophils (Ly6G + cells). Scale bar 
represents 150 i^m. 

(D) 3-MCA-induced sarcoma incidence in PTX3-chimeric mice (Donor — ► Recipient). **p < 0.01, paired Student’s t test. 

See also Figure S2. 



tumors (day 100), PTX3 immunostaining was associated with 
vessels, macrophages, neutrophils, and interstitial stroma, but 
not to tumor cells (Figures 2B and 2C). Accordingly, most sar- 
coma cell lines isolated from Ptx3 +/+ tumor-bearing mice did 
not produce PTX3 in basal or inflammatory conditions (Fig- 
ure S2C). PTX3 expression in cells of the monocyte-macrophage 
lineage was confirmed by RT-PCR analysis, with highest levels in 
MCHII low macrophages (see below) (Figure S2D). At early time 
points (e.g., day 15), PTX3 expression was strongly induced in 
F4/80 + cells (macrophages) in the skin and to a lesser extent in 



liver, as well as in stromal cells (Figure S2A). In the latter cell 
type, substantial coexpression of a marker of DNA damage 
response (DDR), y-H2Ax, was observed only at early time points 
to wane thereafter (Figure S2B). Collectively these results sug- 
gest that infiltrating leukocytes and endothelial cells are a major 
source of PTX3 and that silencing of PTX3 expression in tumor 
cells occurs generally during progression. 

Bone marrow chimeras were used to assess the relative 
importance of hematopoietic and nonhematopoietic (including 
tumor) cells as a source of PTX3 and in the protection against 
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Figure 3. PTX3 Deficiency Is Associated to Increased CRI 

(A) Histological and immunohistochemical analysis of macrophage (CD68 + ) and neutrophil (Ly6G + ) infiltration in Ptx3~'~ and Ptx3 +/+ 3- MCA- injection site at 
7 days. 

(B) MPO levels in 3-MCA-injection site at 7 days (mean ± SEM). See also Table SI . 

(C) Analysis of the leukocyte infiltrate in 3-MCA-sarcoma by FACS. Total leukocytes (CD45 + ), macrophages (F4/80VCD1 1b + ), monocytes (CD11b + /Ly6C + ), 
neutrophils (CD1 1 b + /Ly6G + ), and lymphocytes (CD3 + ) were analyzed (mean ± SEM). 

(legend continued on next page) 
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carcinogenesis. PTX3 tumor concentration in the two chimeric 
groups (I Ptx3 +/+ mice receiving Ptx3~ f ~ bone marrow and 
Ptx3~ f ~ mice receiving Ptx3 +/+ bone marrow) were comparable 
(2.15 ng/mg ± 0.8 ng/mg and 1.89 ng/mg ± 0.1 ng/mg) and sig- 
nificantly lower than in Ptx3 +/+ tumors (6.61 ng/mg ± 0.2 ng/mg, 
p < 0.003), suggesting that both nonhematopoietic cells and he- 
matopoietic cells produce PTX3 in tumors. PTX3 competence in 
hematopoietic cells or in stromal cells was insufficient to recon- 
stitute the full “protected” phenotype observed in Ptx3 +/+ mice 
or in reconstituted fully competent mice (Figure 2D). Thus, 
PTX3 derived from both hematopoietic and nonhematopoietic 
cells contributes to protection against carcinogenesis. 

PTX3 Deficiency Is Associated with Increased 
Cancer-Related Inflammation 

PTX3 does not affect the proliferation of normal and transformed 
cells (Figure SI B). It interacts with Fey receptors (Lu et al., 2008; 
Moalli et al., 2010) and has antibody-like properties (Bottazzi 
et al., 201 0) including the capacity to regulate inflammatory reac- 
tions. Therefore, we focused on CRI in Ptx3~'~ mice. We first 
evaluated leukocyte recruitment into the 3-MCA-injection site 
at early time points in Ptx3 +/+ and Ptx3~ f ~ mice. As shown in Fig- 
ure 3A, on day 7 after injection macrophages (CD68 + cells) and 
neutrophils (Ly6G + cells) recruited around 3-MCA-containing 
corn oil droplets were more abundant in Ptx3~'~ compared to 
Ptx3 +/+ mice. In agreement, the MPO content (Figure 3B), as 
well as the levels of CCL2 and CXCL2 (Table SI), were signifi- 
cantly higher in homogenates of tissues collected from Ptx3~ f ~ 
mice, compared to Ptx3 +/+ mice. 

We next analyzed the leukocyte infiltrate of tumors collected at 
their maximum volume (3 cm 3 ) by flow cytometry. As shown in 
Figure 3C, the percentage of macrophages (F4/80 + /CD1 1 b + 
cells) and monocytes (Ly6C hl9h /CD1 1 b + cells) in CD45 + cells 
was higher in Ptx3~'~ compared to Ptx3 +/+ mice, whereas the 
percentage of neutrophils (Ly6G high /CD1 1 b + cells) was signifi- 
cantly lower. The percentage of T lymphocytes (CD3 + cells) 
was comparable in the two groups. Similar results were obtained 
when the analysis was performed at day 100 after 3-MCA-treat- 
ment in tumors of different size in Ptx3~'~ and Ptx3 +/+ mice (Fig- 
ure S3A). Histological analysis confirmed increased macrophage 
infiltration in Ptx3~ f ~ sarcomas and showed higher density of 
CD31 + vessels (Figures 3D and 3E). Similar lymphocyte infiltra- 
tion in the two groups was confirmed by histological analysis 
(Figure 3F). 

In addition, the concentrations of proinflammatory cytokines 
TNFa, IL-lp and IL-6, CCL2, and the proangiogenic vascular 
endothelial growth factor (VEGF) were significantly higher in 
Ptx3~ f ~ tumor homogenates than in Ptx3 +/+ . IL-6 and VEGF 
levels were significantly higher also at the systemic level in 
Ptx3~ f ~ mice (Table SI). Treatment with recombinant PTX3 
reduced macrophage infiltration and angiogenesis abolishing 
the difference between the two genotypes (Figures 3G and S3B). 



PTX3 Deficiency Is Associated to Increased 
Complement Activation 

PTX3 has been shown to tune inflammatory responses by modu- 
lating P-selectin-dependent leukocyte recruitment and Comple- 
ment activation (Deban et al., 2010; Inforzato et al., 2013). We 

first addressed the role of PTX3-P-selectin interaction in leuko- 
cyte recruitment and in tumor growth by evaluating sarcoma 
incidence in P-selectin-deficient and in PTX3/P-selectin-dou- 
ble-deficient mice. As shown in Figure S3C, we found that 
PTX3 deficiency was associated to higher tumor incidence 
regardless of P-selectin competence. Moreover, P-selectin defi- 
ciency did not affect susceptibility to 3-MCA carcinogenesis. 

Next, we analyzed Complement activation. Confocal micro- 
scopy of the injection site revealed significantly higher C3 immu- 
noreactivity in areas surrounding 3-MCA-containing oil droplets 
in Ptx3~ f ~ mice, in comparison to Ptx3 +/+ mice, where C3 depo- 
sition was negligible (Figure 4A). In tumors, C3 deposition 
occurred also in Ptx3 +/+ lesions, but it was significantly lower 
than in Ptx3~ ; ~ sarcomas (Figure 4B, two pooled experiments). 
An analysis of C3 deposition and PTX3 expression in each 
Ptx3 +/+ lesion was performed on samples from one experiment 
and showed that C3 and PTX3 were significantly inversely corre- 
lated (Pearson r = -0.65, p = 0.03). Treatment with recombinant 
PTX3 reduced C3 immunoreactivity in Ptx3~ f ~ tumors to levels 
observed in Ptx3 +/+ tumors (Figure 4C). 

In addition, a higher C5a concentration was measured in 
Pfx3 _/_ tumor homogenates compared to Ptx3 +/+ ones (Fig- 
ure 4D). In contrast, the immunoreactivity for C5b-9, which was 
associated with necrotic tumor areas, did not differ in Ptx3~ f ~ 
and Ptx3 +/+ tumors (Figure 4E). We next analyzed the immunore- 
activity for Factor H and C4BP, because PTX3 interacts with 
these two Complement-regulatory proteins without interfering 
with their functional activity and facilitates their deposition on 
damaged cells, amplifying their regulatory potential (Braunsch- 
weig and Jozsi, 201 1 ; Deban et al., 2008). As shown in Figure 4F, 
the immunostaining for Factor H in tumors from Ptx3~'~ mice 
was significantly lower than in Ptx3 +/+ tumors. A correlation anal- 
ysis showed that PTX3 expression and Factor H deposition were 
significantly positively correlated (Pearson r = 0.71 , p = 0.01). We 
also observed lower C4BP deposition in Ptx3~'~ than in Ptx3 +/+ 
tumors, but the difference did not reach statistical significance 
(p = 0.07) (Figure S4A). 

Role of Complement 

In the effort to assess the pathogenic role of increased Comple- 
ment activation associated to PTX3 deficiency, we evaluated the 
susceptibility to 3-MCA of C3-deficient and PTX3/C3-double- 
deficient mice. Carcinogenesis by 3-MCA and macrophage 
recruitment in C3~ /_ mice were significantly lower than in wild- 
type animals (Figures 4G and 4H). In addition, C3 deficiency 
reduced the susceptibility to 3-MCA of Ptx3~ f ~ mice to levels 
comparable to that of Ptx3 +/+ mice and double deficiency was 



(D-F) Analysis of macrophages (F4/80 + immunoreactive area, IRA), angiogenesis (CD31 + IRA), and lymphocytes (CD3 + cells) in 3-MCA-sarcoma by immuno- 
histochemistry (each dot represents the average of ten fields, mean ± SEM). 

(G) Analysis by FACS of the macrophage infiltrate (mean ± SEM) in 3-MCA-sarcoma of Ptx3~ l ~ and Ptx3 +/+ mice, after treatment with recombinant PTX3 (from day 
0 to 1 00). *p < 0.05, **p < 0.01 , unpaired Student’s t test. 

See also Figure S3. 
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associated to macrophage recruitment comparable to that of 
C3 _/_ mice. Finally, treatment with the C5a receptor antagonist 
PMX53 reduced the susceptibility of Ptx3~ f ~ mice, reverting the 
phenotype, without affecting the tumor growth in wild-type mice 
(Figure 41). 

Factor H-deficient mice are depleted in circulating C3 and suf- 
fer from spontaneous nephropathy (Pickering et al., 2002). 
Therefore, a genetic double deficiency approach would not be 
informative as to the actual role of Factor H. Thus, we set up 
an in vitro model of C3 deposition on Ptx3 +/+ and Ptx3~ f ~ cell 
lines, in the presence of recombinant PTX3 and a recombinant 
Factor FI inhibitor (SRC1 9-20) (Banda et al., 201 3). For these ex- 
periments, we selected one wild-type sarcoma line (Wt-2), 
generated on day 100, which, unlike most wild-type cell lines 
(Figure S2C), produces detectable levels of PTX3 (8 ng/ml in 
24 hr). In line with results obtained in vivo (Figure 4B), C3 depo- 
sition was significantly higher on Ptx3~ / ~ than on Ptx3 +/+ produc- 
ing cells (Figures 4J and 4K). Recombinant PTX3 reduced C3 
deposition on Pfx3 _/_ cells abrogating the difference between 
Ptx3 +/+ and Ptx3~ f ~ cells, and SRC19-20 that interferes with 
the interaction between PTX3 and Factor H (Deban et al., 
2008), caused higher C3 deposition on Ptx3 +/+ cells and did 
not further increase C3 deposition on Ptx3~ f ~ cells. Notably, 
the rescue of C3 deposition on Ptx3~'~ cells obtained with re- 
combinant PTX3 was abolished in the presence of SRC19-20. 

To further address whether PTX3 regulates immune com- 
plexes (IC)-dependent Complement and leukocyte activation, 
we measured circulating 1C and analyzed their composition 
and proinflammatory activity. PTX3 deficiency was associated 
with normal levels of total 1C and C3-containing 1C (Figure S4C). 
Qualitative analysis revealed the presence of PTX3 in 1C, but the 
amount of C3, Clq, and total Ig did not differ between Ptx3 +/+ 
and Ptx3~ / ~ 1C (Figure S4D). Finally, Ptx3 +/+ and Ptx3~'~ 1C 
equally activated macrophages (expression of polarization 
markers and cytokines) indicating that PTX3 does not regulate 
CRI through 1C in this cancer model (Figure S4E). 

All together, these data suggest that Complement is an essen- 
tial component of tumor-promoting inflammation and that the 
increased Complement activation associated to PTX3 deficiency 
due to lack of recruitment of Factor H has a major role in sus- 
taining an exacerbated inflammatory response and enhanced 
carcinogenesis. 

Other Components of Cancer-Related Inflammation 

As shown in Figure 3C and Table SI , PTX3 deficiency was asso- 
ciated with increased macrophage infiltration and increased 
levels of the monocyte attracting chemokine CCL2. Complement 
activation and C5a in particular induce chemokine production. It 
was therefore important to assess the role of CCL2 in the 
increased susceptibility to carcinogenesis exhibited by PTX3- 
deficient mice. To this aim, we treated Ptx3~'~ and Ptx3 +/+ 
mice with an anti-CCL2 Ab. As shown in Figure 5A, the anti- 
CCL2 treatment did not modify the tumor incidence of Ptx3 +/+ 
mice, whereas it reduced the susceptibility of Pfx3 _/_ mice, 
completely reverting the phenotype. 

CCL2 has been shown to skew macrophage polarization in an 
M2-like direction (Roca et al., 2009; Sierra-Filardi et al., 201 4), we 
therefore characterized the phenotype of tumor-associated 



macrophages (TAM) in Ptx3 7 and Ptx3 +/+ mice. Upon puri- 
fication of CD11b + /Ly6G“ cells, MHCII high (CD1 1 b + /Ly6C“/ 
MHCII high ) and MHCII low (CD1 1 b7Ly6C“/MHCII low ) macro- 
phages and monocytes (CD1 1 b + /Ly6C + /MHCir) were sorted 
to >98% purity to analyze the expression of Ml - and M2-like 
markers as described (Laoui et al., 2014). As shown in Figures 
5B-5D and Table S2, Ptx3~'- MHCII high and MHCII low macro- 
phages and monocytes expressed higher levels of M2-like 
markers (in particular Argl , Yml , Fizzl , 1110 , and Mcrl in macro- 
phages and Yml and Fizzl in monocytes) and lower levels of Ml 
markers (in particular 1112a and Ifng) in comparison to Ptx3 +/+ 
macrophages and monocytes. Moreover Ptx3~ /_ macrophages 
(MHCII high or MHCII low ) expressed higher levels of Nos2, which is 
overexpressed by TAMs (Murray et al., 2014). 

Increased Trp53 Mutations and DNA Damage in 
Ptx3 ' Mice 

Because CRI is potentially a cause of gene instability (Colotta 
etal., 2009), we next addressed whether the increased tumor inci- 
dence observed in Ptx3~ f ~ mice was associated to increased 
gene mutation. To this aim, we evaluated the frequency of muta- 
tions in Trp53 and Kras, which are target genes mutated by 3- 
MCA, in sarcoma cell lines isolated from Ptx3~'~ and Ptx3 +/+ tu- 
mor-bearing mice. The genomic DNA of 1 0 Ptx3 +/+ and 1 2 Ptx3~'~ 
3-MCA-induced sarcomas was examined by direct sequencing of 
exons 5 to 8 of the Trp53 gene and of exons 1 and 2 of Kras gene. 
In line with previous studies (Watanabe et al., 1999), eight out of 
ten wild-type sarcomas (80%) carried mutations of Trp53 gene 
in 1 2 different codons, whereas Kras mutations were less frequent 
(20%) and were restricted to two codons (Table S3). As shown in 
Figure 6A, the number of Trp53 mutations was significantly higher 
in Ptx3~ f ~ than in Ptx3 +/+ sarcoma cell lines, whereas no differ- 
ences were observed in Kras gene mutations (Figure 6B). Interest- 
ingly, similar numbers of mutations were observed in Ptx3~ cell 
lines collected 1 00 days upon treatment or at the end point (tumor 
volume 3 cm 3 ) suggesting that the increased Trp53 gene in- 
stability is not due to accumulation of mutations during tumor 
progression (Table S3). We finally addressed the functional conse- 
quences of Trp53 mutations. As shown in Figure 6C, the expres- 
sion ofMdm2, Bax, and Cdknla (p21) downstream p53 activation, 
was significantly reduced in Ptx3~'~ tumor cell lines in comparison 
to Ptx3 +/+ , indicating that the higher incidence of Trp53 mutations 
observed in Ptx3~'~ is likely associated with increased loss of 
onco-suppressor function. 

To further investigate whether and to what extent the 
increased inflammation in Ptx3~ f ~ mice translated into DNA 
damage we studied DNA damage by performing immunohisto- 
chemical analysis of 8-OFI-deoxiguanosine (8-OFI-dG), a modi- 
fied DNA base generated during oxidative damage. As shown 
in Figure 6D, the number of 8-OH-dG-positive nuclei was signif- 
icantly higher in Ptx3^~ than in Ptx3 +/+ tissues 15 days after 3- 
MCA injection. In addition, the expression of the DDR markers, 
y-H2Ax and 53BP1 was significantly higher in Ptx3^~ tissues 
(Figures 6E and 6F). The relevance of damage caused by reac- 
tive species was addressed by treating 3-MCA-injected mice 
with amminiguanidine hemisulphate (AG), an irreversible inhibi- 
tor of iNOS. AG significantly reduced the tumor incidence in 
Ptx3~'~ mice (Figure 6G). In addition, treatment with AG 
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Figure 4. PTX3 Deficiency Is Associated to Increased Complement Deposition in 3-MCA-Sarcoma 

(A-C) Analysis by confocal microscopy of C3 deposition in the injection site (*) at 7 days (A) or in tumors (B and C) of Ptx3 ~ and Ptx3 +/+ mice, after treatment with 
recombinant PTX3 (from day 0 to 100) (C). Correlation analysis between C3 and PTX3 expression was performed in wild-type tumors (B). 

(D) Analysis by ELISA of C5a concentrations, (mean ± SEM, two pooled experiments), in 3-MCA-tumors. 

(E and F) Analysis by confocal microscopy of C5b-9 (E) and Factor H (F) in 3-MCA-tumors. Correlation analysis between Factor H and PTX3 expression was 
performed in wild-type tumors. 

(G) Incidence of 3-MCA-sarcoma in C3~ l ~ and/or Ptx3 ~ 7 and Ptx3 +/+ mice (n = 10-15). 

(H) Analysis of the macrophage (F4/80 + /CD1 1 b + ) infiltrate in 3-MCA-sarcoma by FACS in C3~'~ and/or Ptx3~'~ and Ptx3 +/+ mice. 

(I) 3-MCA-sarcoma incidence in Ptx3 and Ptx3 +/+ mice treated with the C5aR antagonist PMX53 (n = 7). 



(legend continued on next page) 
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Figure 5. Role of CCL2 in Carcinogenesis and TAM Polarization in PTX3-Deficient Mice 

(A) Incidence of 3-MCA-sarcoma in Ptx3~'~ and Ptx3 +/+ mice treated with anti-CCL2, or irrelevant mAb (n = 9-11) (100 |xg/mouse three times/week) for the 
duration of the experiment. ***p < 0.001 , paired Student’s t test. 

(B-D) Analysis by RT-PCR of selected Ml and M2 markers of macrophages (B and C) and monocytes (D) infiltrating 3-MCA-sarcoma in Ptx3 7 and Ptx3 +/+ mice 
(mean ± SEM). *p < 0.05, **p < 0.01 , ***p < 0.001 , unpaired Student’s t test. In (B), (C), and (D), data were relative to 18S expression and normalized versus the 
mean of wild-type and expressed as mean ± SEM. 

See also Table S2. 



abolished the difference in 8-OH-dG and y-H2Ax immunoreac- 
tivity between Ptx3~ f ~ and Ptx3 +/+ tissues (Figures 6D and 6E). 
These results suggest that increased CRI in PTX3-deficient 
mice is associated with increased genetic instability as assessed 
by the frequency of Trp53 mutations, oxidative DNA damage, 
and expression of DDR markers. 

Methylation of the PTX3 Gene in Human Cancer 

We next investigated whether results obtained in the mouse are 
indeed relevant to human cancer. Studies in human esophageal 



squamous cell carcinoma had shown that the PTX3 promoter is 
hypermethylated in this cancer and PTX3 expression is inhibited 
(Wang et al., 2011). In addition, in silico bioinformatics analysis 
in the Epigenomics database (http://www.ncbi.nlm.nih.gov/ 
epigenomics) showed PTX3 gene methylation in colorectal can- 
cer (CRC) (File: GSM801957). We thus analyzed the methylation 
status of the promoter and of a CpG island located in a putative 
enhancer encompassing the second exon (Akhtar-Zaidi et al., 
2012) in a series of mesenchymal and epithelial cancers. As 
shown in Figure 7A, both the promoter and the CpG island are 



(J) Immunofluorescence analysis of C3 deposition on Ptx3 +/+ and Ptx3~ f cells, in the presence of recombinant PTX3, SRC19-20, or their combination. Arrows 
indicate faint C3 immunostaining in Ptx3 +/+ cells in basal conditions or after addition of PTX3 in the assay. 

(K) Quantification of C3 deposition evaluated by immunofluorescence. *p < 0.05, **p < 0.01, ***p < 0.001, unpaired Student’s t test (A-F, H, and K), paired 
Student’s t test (G and I). For (A), (B), (C), (E), (F), and (K), each dot represents the average of mean fluorescence intensity of ten regions of interest (ROI) per tumor 
(A-C, E, and F) or well (K) (mean ± SEM). Scale bars represent 150 [xm. 

See also Figure S4. 
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Figure 6. PTX3 Deficiency Is Associated to Increased Genetic Instability 

(A and B) Number of Trp53 (A) and Kras (B) mutations (mean ± SEM) in cell lines isolated from Ptx3~ f and Ptx3 +/+ tumors. See also Table S3. 

(C) Analysis of the expression of p53-regulated genes by RT-PCR in cell lines isolated from Ptx3~ l and Ptx3 +/+ tumors. Data were relative to GAPDH expression 
and represented as mean ± SEM. 

(D) Immunohistochemical analysis of 8-OH-dG 1 5 days after 3-MCA injection in Ptx3 +/+ and Ptx3 1 mice treated with AG. Left: data are reported as mean number 
± SEM of 8-OH-dG + nuclei per field. Right: representative immunofluorescence images are shown. 

(E and F) Immunofluorescence analysis of y-H2Ax (E) and 53BP1 (F) 15 days after 3-MCA injection in Ptx3 +/+ and Ptx3~'~ mice treated with AG (E). Left panels: 
data are reported as MFI of y-H2Ax (E) and 53BP1 (F) positive foci per field and expressed as (mean ± SEM). Right panels: representative immunofluorescence 
images are shown. 

(G) Sarcoma incidence in Ptx3 +/+ and Ptx3~'~ mice treated with AG in the drinking water for the entire duration of the experiment. *p < 0.05, **p < 0.01 , ***p < 0.001 , 
unpaired (A-F) or paired (G) Student’s t test. 



highly methylated in leyomiosarcomas, desmoid tumors, CRC, 
and skin squamous cell carcinoma, in contrast with normal 
mesenchymal or epithelial tissues. We further investigated 
PTX3 methylation and the functional consequences in CRC. 



We analyzed PTX3 methylation in five patients with high-grade 
adenomas and 40 patients with microsatellite stable CRC at 
different stage (I— IV). As shown in Figure 7B, the methylation of 
the promoter region and of the CpG island progressively 
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increased from normal colon epithelium, to adenomas and to 
CRC independently of stage. The healthy tissue from CRC 
patients was more methylated than normal colon epithelium. 
Similar results were obtained in colorectal tumor cell lines (Fig- 
ures 7C and S5A), in which treatment with the methylation inhib- 
itor 5-Aza-2'-deoxycytidine (5-AZA-dC) significantly reduced the 
methylation of both the promoter and the CpG island. In the pres- 
ence of 5-AZA-dC, the expression of PTX3 mRNA was signifi- 
cantly increased in five different cell lines analyzed, both under 
basal conditions and upon stimulation with TNFa or IL-1(3 (Fig- 
ures 7D and S5B). Rescue of PTX3 protein expression was 
confirmed in 3 different cell lines (Figures 7D and S5C). In con- 
trast, treatment with 5-AZA-dC did not increase PTX3 expres- 
sion in normal endothelial or normal colon epithelial cells stimu- 
lated with TNFa (Figures S5D and S5E). In addition, chromatin 
immunoprecipitation (ChIP) analysis showed that histone modi- 
fications associated to transcriptional activation (H3K4me3, 
H3K27ac, and H3K9ac) were strongly increased in the PTX3 
promoter after treatment with 5-AZA-dC and TNFa, whereas tri- 
methylation of H3K27 associated to gene repression was not 
modified (Figure S5F). Finally, the binding of NF-kB, c-Jun, and 
c-Fos to their consensus binding sites in the promoter and tran- 
scription factor binding sites where above the basal level only 
after cell treatment with 5-AZA-dC in association with a proin- 
flammatory stimulus in CRC cells (Figure 7E). Collectively, these 
data suggest that the PTX3 gene is silenced by methylation in 
selected human tumors including CRC. 

DISCUSSION 

The general objective of this investigation was to gain insight into 
the role of the humoral arm of innate immunity in CRI using the 
long pentraxin PTX3 as a paradigm (Bottazzi et al., 2010). We 
observed increased susceptibility to mesenchymal (3-MCA) 
and epithelial (DMBA/TPA) carcinogenesis in PTX3-deficient 
mice. The 3-MCA model was used for further analysis. This 
phenotype was associated with increased macrophage infiltra- 
tion, cytokine production, angiogenesis, and genetic instability 
as revealed by an increased frequency of Trp53 mutations, 
oxidative DNA damage, and expression of DDR markers in 
Ptx3~ f ~ mice. Several lines of evidence suggest that PTX3 defi- 
ciency unleashes unrestrained Complement activation with pro- 
duction of C5a, CCL2 production, and enhanced recruitment of 
tumor-promoting macrophages. In selected human tumors (e.g., 
leiomyosarcomas and CRC) PTX3 expression was regulated 
epigenetically by methylation of the promoter region and of a pu- 
tative enhancer. Thus, an essential component of the humoral 
arm of innate immunity and regulator of Complement activation 
acts as an extrinsic oncosuppressor by acting at the level 
of Complement-mediated, macrophage-sustained, tumor-pro- 
moting inflammation. 

PTX3 is a humoral pattern recognition molecule essential 
for resistance against selected microorganisms including 
A. fumigatus, P. aeruginosa (Garlanda et al., 2002), Klebsiella 
pneumonia (Soares et al., 2006), and uropathogenic Escherichia 
coli (Jaillon et al., 2014). PTX3 is highly conserved in evolution 
and genetic evidence is consistent with a role of PTX3 in antimi- 
crobial resistance in humans (Chiarini et al., 2010; Cunha et al., 



2014; Olesen et al., 2007). PTX3 has antibody-like properties, 
including recognition of microbial moieties (Jeannin et al., 
2005), opsonization via Fey receptors (Moalli et al., 2010), Com- 
plement activation and regulation (Deban et al., 2008; Inforzato 
et al., 2013), and regulation of inflammation (Deban et al., 2010; 
Lech et al., 2013; Norata et al., 2009; Salio et al., 2008; Soares 
et al., 2006). Given the fact that tumor-promoting inflammation 
is now recognized as an essential component of the tumor micro- 
environment (Coussens et al., 201 3; Grivennikov et al., 201 0; Ha- 
nahan and Weinberg, 2011; Mantovani et al., 2008), the 
increased susceptibility of PTX3-deficient mice to carcinogen- 
esis is a reflection of its regulatory function on inflammation. 

PTX3 binds P-selectin via its glycosidic component (Deban 
et al., 2010) and this accounts for the regulatory function on 
inflammation of PTX3 in selected P-selectin-dependent experi- 
mental models (Lech et al., 2013). It was therefore important to 
assess the relevance of the P-selectin pathway in the enhanced 
carcinogenesis observed in Ptx3~'~ mice. P-selectin-deficient 
mice show no phenotype in terms of 3-MCA carcinogenesis. 
Moreover, PTX3/P-selectin-double-deficient mice showed 
enhanced 3-MCA carcinogenesis compared to controls. Thus, 
the P-selectin pathway is not involved in the enhanced suscep- 
tibility to carcinogenesis of PTX3-deficient mice. 

PTX3 interacts with Complement at multiple levels (Inforzato 
et al., 2013). It binds Clq (Bottazzi et al., 1997) and activates 
or regulates (Nauta et al., 2003) the classic pathway of Comple- 
ment activation. By interacting with ficolins and MBL it indepen- 
dently impinges upon the alternative pathway (Gout et al., 201 1 ; 
Ma et al., 2009). PTX3 interacts and recruits Factor H by binding 
domains 19-20 and 7 through its N-terminal and glycosylated 
C-terminal domains, respectively, without interfering with Factor 
H capacity to negatively regulate the Complement cascade (De- 
ban et al., 2008). We found an inverse relationship between C3 
deposition and PTX3 presence in individual tumors. Conversely, 
the occurrence of Factor H was directly and significantly corre- 
lated with PTX3 deposition. In an effort to assess the actual sig- 
nificance of the Complement cascade in the onco-suppressive 
function of PTX3, double C3/PTX3-deficient mice were gener- 
ated. Genetic inactivation of C3 strongly reduced, but did not 
completely ablate, the tumor-promoting function of PTX3 defi- 
ciency, suggesting that other mechanisms in addition to un- 
leashed complement may contribute to increased CRI and sus- 
ceptibility to cancer in PTX3-deficient mice. 

A genetic approach to address the role of Factor H in vivo us- 
ing double knockout mice is hindered by systemic C3 consump- 
tion and associated pathology in Factor H-deficient mice (Banda 
et al., 2013; Pickering et al., 2002). However, in an in vitro assay 
of C3 deposition on PTX3 competent and PTX3-incompetent cell 
lines, a recombinant Factor H inhibitor (SRC1 9-20) (Banda et al., 
2013), which interferes with the binding between PTX3 and Fac- 
tor H (Deban et al., 2008), increased C3 deposition on PTX3 pro- 
ducing cells and abolished the rescue of C3 deposition on 
Ptx3~ f ~ cells obtained with recombinant PTX3. Thus, we infer 
that PTX3 deficiency unleashes Complement-dependent, C5a- 
mediated tumor-promoting inflammation, because of defective 
recruitment of the negative regulator Factor H. 

Complement is not considered a canonical component of tu- 
mor-promoting inflammation (Coussens et al., 2013; Hanahan 
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Figure 7. The PTX3 Gene Is Epigenetically Modified in Human Cancers 

(A) Analysis by methylated CpG island recovery assay (MIRA) of the percentage of methylation enrichment of the promoter region and of the PTX3 CpG island in 
human mesenchymal and epithelial tumors. CpG rich regions (iCGs) (PI, P2, transcription factor binding site [TFBS], and CpG island [iCG1-9]) and surrounding 

(legend continued on next page) 
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and Weinberg, 2011; Mantovani et al., 2008). Complement can 
represent an effector mechanism against hematopoietic tumors 
activated by monoclonal antibodies (Golay et al., 2001). The gen- 
eration of Complement C5a in a transplanted tumor enhanced 
growth by recruiting myeloid-derived suppressor cells and 
amplifying their T cell-directed suppressive abilities (Markiewski 
et al., 2008). However, no information based on genetic ap- 
proaches concerning the role of Complement components in 
carcinogenesis is available. Here, we found that C3-deficient 
mice are strongly protected against 3-MCA (Figure 4G) and 
DMBA/TPA (Figure S4B) carcinogenesis, involving mesen- 
chymal and epithelial cells, respectively. These results demon- 
strate that Complement is an essential component of tumor-pro- 
moting inflammation. 

The Complement cascade can be activated by antibodies or 
by sensing tissue damage (Ricklin et al., 2010). Antibodies 
have been shown to orchestrate cancer-related inflammation 
in a model of epithelial carcinogenesis (HPV16 transgenic 
mice) by a remote control mechanism (Andreu et al., 2010). 
However, in that system the pathway driving tumor-promoting 
inflammation was Fey receptor-dependent and Complement-in- 
dependent (Andreu et al., 2010; de Visser et al., 2004). It is 
therefore tempting to speculate that an antibody-independent 
pathway (e.g., interaction with senescent or apoptotic cells) is 
responsible for Complement-mediated orchestration of tumor- 
promoting inflammation in the two models of mesenchymal 
(3-MCA) and epithelial (DMBA/TPA) carcinogenesis used in the 
present study. Interestingly, in the present study, C3-deficient 
mice showed reduced susceptibility to DMBA/TPA skin carcino- 
genesis but were not protected against HPV1 6-driven squamous 
carcinoma development (Andreu et al., 2010; de Visser et al., 
2004). CRI can differ considerably in its drivers and components 
in tumors originating in different organs and tumors (Ruffell et al., 
2012). It is likely that Complement may play different roles in 
different carcinogenesis contexts. 

Macrophages are an essential component of tumor-promot- 
ing inflammation (Qian and Pollard, 2010; Sica and Mantovani, 
201 2). Tumors originating in a PTX3-deficient context were char- 
acterized by an increased frequency of TAM throughout the nat- 
ural history of the tumor. Increased macrophage accumulation 
was completely abolished in C3-deficient or C3/PTX3-double- 
deficient hosts. PTX3 deficiency was associated with increased 
C5a and CCL2. Both molecules are potent monocyte attrac- 
tants. Interestingly at the end point (tumor volume 3 cm 3 ) macro- 
phage, but not neutrophil, numbers were increased in PTX3-defi- 



cient hosts, consistent with a function of CCL2 in sustaining TAM 
recruitment and tumor-promoting inflammation. Macrophages 
in tumors usually acquire a skewed M2-like phenotype oriented 
to tumor promotion (Sica and Mantovani, 2012). Chemoattrac- 
tants, and CCL2 in particular, have been shown to skew macro- 
phage function in an M2-like direction (Roca et al., 2009). In 
PTX3-deficient hosts, TAM showed enhanced expression of 
molecules associated with M2-like polarization, including Arg /, 
Fizzl , and Yml . Skewing of macrophage function with, for 
instance, increased immunosuppressive activity and may 
contribute to enhanced carcinogenesis in PTX3-deficient hosts. 
Antibodies directed against CCL2 reverted the phenotype of 
PTX3-deficient mice in terms of enhanced susceptibility to 3- 
MCA carcinogenesis but did not protect PTX3-competent 
mice, as observed in other murine models and clinical trials (Lo- 
berg et al., 2007; Pienta et al., 2013) suggesting that the treat- 
ment is effective only under conditions of exacerbated CCL2 
production and M2-polarization. Chemoattractants including 
C5a induce chemokines (Sozzani et al., 1995). Thus, uncon- 
trolled Complement activation with C5a production is likely to 
be upstream of CCL2 that in turn recruits tumor-promoting 
TAM in increased numbers, with skewed M2-like phenotype, in 
PTX3-deficient hosts. However, the strong reduction of TAM in 
PTX3/C3-double-deficient mice was not sufficient to completely 
rescue the susceptibility to carcinogenesis of Ptx3~ / ~ mice, sug- 
gesting the involvement of other cell types or mechanisms in 
addition to increased TAM in the observed phenotype. 

Inflammatory mediators have been suggested to contribute to 
genetic events that cause cancer and to the genetic instability of 
tumors, although there is little formal evidence for that (Colotta 
et al., 2009). PTX3 is an extracellular molecule whose only 
cellular receptors identified so far are Fey receptors (Lu et al., 
2008; Moalli et al., 2010). PTX3 does not affect the growth and 
death of normal or neoplastic cells. PTX3-deficient mice showed 
increased susceptibility to carcinogenesis mediated by Comple- 
ment-dependent tumor-promoting inflammation. Tumors devel- 
oped in a PTX3-deficient context characterized by exacerbated 
inflammation had a higher frequency of Trp53 mutations, in- 
creased DNA oxidative damage and higher expression of DDR 
markers (Figure 6). These findings indicate that exacerbated 
inflammation in the tumor microenvironment can indeed 
contribute to the genetic instability of cancer. 

In an effort to assess the actual significance of the present 
findings to human cancer, supported by a bioinformatics anal- 
ysis showing methylation of the PTX3 gene in CRC, we searched 



regions analyzed by MIRA are shown below the structure of PTX3 gene. Healthy epithelial tissues were from the normal counterpart of colon cancer patients 
(n = 3) and skin squamous carcinoma (n = 5); healthy mesenchymal tissues were from leiomyosarcoma (n = 1), angiosarcoma (n = 3), chordoma (n = 3), and 
desmoid tumor (n = 2) patients. 

(B) Analysis by MIRA of the percentage of methylation enrichment of the same regions in healthy colon epithelium, high grade adenomas, CRC, and healthy 
counterparts for each CRC stage. 

(C) Analysis by MIRA of the percentage of methylation enrichment of the same regions in one normal colon epithelial cell line (HcoEpic) and two CRC cell lines 
(RKO and HCT1 16) treated or not with the inhibitor of DNA methylation 5-AZA-dC. 

(D) PTX3 mRNA expression and protein production by RKO and HCT1 16 cell lines treated or not with 5-AZA-dC in the presence of TNFa20 ng/mLor IL-1 3 20 ng/ 
ml_. Healthy colon: primary human intestinal epithelial crypt cells isolated from surgical biopsies. 

(E) Analysis by ChIP of NF-kB, c-Fos, and c-Jun binding to the promoter, TFBS, and an irrelevant region in RKO cell line treated or not with 5-AZA-dC in the 
presence of TNFa 20 ng/ml. Data are expressed as percentage of enrichment relatively to input DNA normalized on a positive control and represented as mean ± 
SEM. *p < 0.05, **p < 0.01 , ***p < 0.001 , unpaired Student’s t test. 

See also Figure S5. 
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for epigenetic modifications of the PTX3 gene in mesenchymal 
and epithelial tumors. In selected human tumors (e.g., leiomyo- 
sarcomas, CRC, and skin squamous cell carcinoma), we found 
that the PTX3 gene was hypermethylated at the promoter region 
and in a CpG rich putative enhancer encompassing exon 2. The 
latter is indeed a functional enhancer (M.R., unpublished data). 
PTX3 gene methylation was detected in CRC stages 1-4 as 
well as in adenomas, suggesting that this is an early event in 
the natural history of CRC (Vogelstein et al., 2013). Hypermethy- 
lation at the PTX3 locus was also observed in CRC cell lines, 
where treatment with 5-AZA-dC rescued the PTX3 production 
in response to inflammatory cytokines. ChIP analysis showed 
that only after treatment with 5-AZA-dC and an inflammatory 
stimulus of CRC cells, histone modifications associated to tran- 
scriptional activation occurred and the transcription factors 
responsible of PTX3 expression (e.g., NF-kB, c-Jun, c-Fos) inter- 
acted with their binding sites in the PTX3 promoter region. Thus 
the PTX3 gene is silenced by hypermethylation in selected hu- 
man tumors including CRC and this event occurs early in pro- 
gression already at the level of adenomas. 

The results presented here demonstrate that a regulatory 
component of the humoral arm of innate immunity, PTX3, acts 
as an extrinsic oncosuppressor gene in mouse and man. 
Although the connection between inflammation and cancer has 
long been perceived (Coussens et al., 2013; Grivennikov et al., 
2010; Hanahan and Weinberg, 2011; Mantovani et al., 2008), 
we are not aware of a precedent for a humoral innate immunity 
effector molecule acting as a cancer gene. An additional general 
implication of the results reported here relates to Complement. 
The results obtained suggest that Complement is a key compo- 
nent of tumor-promoting cancer-related inflammation. 

EXPERIMENTAL PROCEDURES 

Cancer Models 

Mice were injected s.c. with a single dose of 100 jig of 3-MCA (Sigma-Aldrich) 
dissolved in corn oil and assessed for tumor development over the course of 
5 months. Skin carcinogenesis was performed as reported (Schioppa et al., 
2011) in a two-step DMBA/TPA model. Procedures conformed to protocols 
approved by the Humanitas Clinical and Research Center in compliance 
with national and international law and policies. The study was approved by 
the Italian Ministry of Health. 

Analysis of Complement Deposition 

Deposition of Complement components was analyzed by confocal micro- 
scopy on tumor tissue or cell lines. C3 deposition on tumor cell lines was 
performed by incubating cells with 10% normal mouse serum in gelatin-con- 
taining Veronal buffer, recombinant PTX3, and/or 33 nM Factor H inhibitor pep- 
tide SRC19-20 (Braunschweig and Jozsi, 2011). 

Analysis of DNA Damage 

Genomic DNA was extracted from 10 Ptx3 +/+ and 12 Ptx3~ f ~ 3-MCA-induced 
sarcoma cell lines, and the direct sequencing of exons 5 to 8 of Trp53 gene 
and of exons 1 and 2 of Kras gene was performed as previously reported 
(Watanabe et al., 1999). DNA damage marker (8-OH-dG) and DDR markers 
(y-H2Ax and 53BP1) were analyzed by immunohistochemistry and 
immunofluorescence. 

Patients 

Paraffin-embedded tissues from oncologic Caucasian patients, who under- 
went resection surgery at Humanitas Clinical and Research Center, were 
examined for PTX3 gene epigenetic modification by methylated CpG island re- 



covery assay (MIRA). The Institutional Review Board of the Humanitas Clinical 
and Research Center approved this study (ICH-99/09). 

Detailed procedures for in vivo studies, biochemical assays, FACS analysis, 
immunostaining and confocal analysis, quantitative PCR, statistical analysis, 
and all other methods are described in Extended Experimental Procedures. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Extended Experimental Procedures, five 
figures, and three tables and can be found with this article online at http:// 
dx.doi.org/1 0.101 6/j.cell.201 5.01 .004. 
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SUMMARY 

AMP-activated protein kinase (AMPK) is a master 
sensor and regulator of cellular energy status. Upon 
metabolic stress, AMPK suppresses anabolic and 
promotes catabolic processes to regain energy ho- 
meostasis. Cancer cells can occasionally suppress 
the growth-restrictive AMPK pathway by mutation 
of an upstream regulatory kinase. Here, we describe 
a widespread mechanism to suppress AMPK through 
its ubiquitination and degradation by the cancer-spe- 
cific MAGE-A3/6-TRIM28 ubiquitin ligase. MAGE-A3 
and MAGE-A6 are highly similar proteins normally ex- 
pressed only in the male germline but frequently 
re-activated in human cancers. MAGE-A3/6 are 
necessary for cancer cell viability and are sufficient 
to drive tumorigenic properties of non-cancerous 
cells. Screening for targets of MAGE-A3/6-TRIM28 
revealed that it ubiquitinates and degrades AMPKal . 
This leads to inhibition of autophagy, activation of 
mTOR signaling, and hypersensitization to AMPK ag- 
onists, such as metformin. These findings elucidate a 
germline mechanism commonly hijacked in cancer to 
suppress AMPK. 

INTRODUCTION 

Cells must coordinate multiple metabolic processes in order to 
balance their energy usage with nutrient availability. One promi- 
nent way that this balance is accomplished is through the activity 
of the AMP-activated protein kinase (AMPK). AMPK is a hetero- 
trimeric kinase comprised of catalytic a and regulatory p and y 
subunits that is regulated by the cellular concentrations of 
ATP, ADP, and AMP (Hardie et al., 2012b). When cellular levels 
of ATP fall and ADP/AMP rise, ATP that is bound to the y subunit 
is replaced by ADP and/or AMP, resulting in activation of the cat- 
alytic kinase subunit (Landgraf et al., 2013; Suter et al., 2006). 
Once activated, AMPK generally opposes anabolic energy- 
consuming pathways while promoting catabolic ATP-generating 
pathways. For example, AMPK inhibits ACC1 and mTOR to 
block fatty acid and protein synthesis, respectively, while at 
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the same time it promotes autophagy via multiple pathways 
involving mTOR, ULK1, and VPS34 (Egan et al., 2011; Gwinn 
et al., 2008; Hardie et al., 2012b; Kim et al., 201 1 , 2013). In addi- 
tion to changes in energy levels, upstream kinases such as 
LKB1/STK1 1 and CaMKK regulate AMPK activity by phosphor- 
ylation of its activation loop at T172 (Hawley et al., 2003, 2005; 
Shaw et al., 2004; Woods et al., 2005). 

Although AMPK may in some cases promote late-stage tumor 
growth (Laderoute et al., 2014), multiple lines of evidence sug- 
gest AMPK has critical tumor suppressor activities in both 
humans and experimental models, including mice (Hardie and 
Alessi, 2013; Shackelford and Shaw, 2009). For example, 
knockout of AMPKal in the mouse accelerates development 
of c-Myc-driven lymphomas (Faubert et al., 2013). AMPK’s role 
in suppressing tumor initiation and progression is multifaceted, 
including growth suppression by inhibiting synthesis of cellular 
macromolecules (Hardie et al., 2012b), particularly through 
downregulating the mTOR signaling pathway (Gwinn et al., 
2008; Inoki et al., 2003), and promoting cell-cycle arrest through 
stabilizing p53 and cyclin-dependent kinase inhibitors (Imamura 
et al., 2001; Jones et al., 2005; Liang et al., 2007). Additionally, 
AMPK can oppose the Warburg effect in favor of oxidative phos- 
phorylation through upregulating oxidative enzymes and pro- 
moting mitochondrial biogenesis (Canto et al., 2009; Winder 
et al., 2000). Furthermore, AMPK has recently been shown to 
inhibit epithelial-to-messenchymal transition (EMT) by modu- 
lating the Akt-MDM2-Foxo3 signaling axis (Chou et al., 2014). 

Given the importance of metabolic control and AMPK’s role as 
master sensor and regulator of cellular energy, it is not surprising 
that this signaling axis is de-regulated in a variety of disease 
states, including cancer (Hardie and Alessi, 2013; Shackelford 
et al., 2009). For example, in approximately 20% of lung adeno- 
carcinomas and cervical cancers, signaling through this axis is 
reduced by loss-of-function mutation or deletion of Lkbl/Stkll 
(Matsumoto et al., 2007; Sanchez-Cespedes et al., 2002; Wingo 
et al., 2009). Additionally, AMPK levels have been shown to be 
reduced in some cases of hepatocellular carcinomas and B- 
RAF V600E can downregulate AMPK signaling through inhibition 
of Lkbl/Stkl 1 in melanomas (Esteve-Puig et al., 2009; Lee et al., 
201 2; Zheng et al., 2009; Zheng et al., 201 3). From these multiple 
lines of converging evidence on AMPK’s critical role in tumor 
suppression, there is great interest in the utilization of com- 
pounds that stimulate AMPK activity, such as metformin, in the 
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prevention and treatment of cancer and many clinical trials are 
ongoing (Hadad et al., 2011; Hardie et al., 2012a; Niraula et al., 
2012; Pernicova and Korbonits, 2014). 

Melanoma antigen (MAGE) genes are conserved in all eukary- 
otes, encode for proteins with a common MAGE homology 
domain, and have rapidly expanded to comprise almost 50 
unique genes in humans (Chomez et al., 2001; Feng et al., 

2011). Approximately two-thirds of human MAGEs are consid- 
ered cancer-testis antigens because they are normally restricted 
to expression in the testis but are aberrantly re-expressed in can- 
cer and have antigenic properties (Simpson et al., 2005). The 
functional significance of MAGEs in tumors is not well under- 
stood, but accumulating evidence supports their importance. 
For example, knockdown of MAGE-A3/6 impairs tumor growth 
in mice, whereas expression of MAGE-A3 in MAGE-negative 
cells drives tumor growth and metastasis in vivo (Liu et al., 
2008; Yang et al., 2007). Importantly, we recently showed that 
a defining characteristic of MAGE proteins is their ability to 
bind and potentiate the activity of specific E3 ubiquitin ligases 
(Doyle et al., 2010). For example, MAGE-L2 binds to the 
TRIM27 ubiquitin ligase and promotes ubiquitination of the 
WASH actin assembly complex to facilitate endosomal protein 
recycling (Hao et al., 2013). 

Here, we present evidence for a regulatory axis engaged in 
cancer cells that downregulates AMPK through ubiquitination 
and degradation of AMPKal by the normally testis-restricted 
MAGE-A3/6-TRIM28 E3 ubiquitin ligase complex activated in 
cancer. These findings identify a widespread mechanism for 
downregulating AMPK signaling during tumorigenesis and eluci- 
date an unanticipated mechanism of action for oncogenic MAGE 
cancer-testis antigens. 

RESULTS 

MAGE- A3 and MAGE-A6 Are Physiologically Restricted 
to Expression in the Testis but Are Aberrantly Expressed 
in Cancer 

MAGE-A3 and MAGE-A6 are highly similar, neighboring genes 
located on the X chromosome that encode proteins with 96% 
identity (Figures SI A and SI B). Given their homology and func- 
tional redundancy (see below), we refer to these genes as simply 
MAGE-A3/6 herein. To thoroughly examine the expression 
pattern of MAGE-A3/6, we analyzed their expression by quanti- 
tative RT-PCR (qRT-PCR) in >50 mouse tissues from two strains 
of mice (C57BL/6 and BALB/C). Consistent with previous find- 
ings (De Plaen et al., 1 994), mouse MAGE-A3/6 were completely 
restricted to expression in the testis with no detectable expres- 
sion in any other tissue (Figures 1A, IB, SIC, and SID). We 
extended these analyses to a panel of >20 human tissues and 



found that human MAGE-A3/6 are similarly restricted to expres- 
sion only in the human testis (Figure 1 C). Like other cancer-testis 
antigen genes, MAGE-A3/6 have been reported to be aberrantly 
expressed in tumors (Jang et al., 2001; Shantha Kumara et al., 
2012). Our analysis of a variety of different tumor types from 
patients revealed that MAGE-A3/6 are commonly expressed 
in many cancer types, including breast invasive carcinomas 
(25%), colon adenocarcinomas (50%), and lung squamous cell 
carcinomas (75%; Figures ID and IE). Additionally, expression 
of MAGE-A3 and MAGE-A6 was significantly correlated in breast 
invasive carcinomas, colon adenocarcinomas, and lung squa- 
mous cell carcinomas (Figures 1F-1H). However, expression of 
MAGE-A3 was not significantly correlated with expression of un- 
related MAGE-A11 or MAGE-B2 (Figures S1E and S1F), sug- 
gesting that MAGE-A3 and MAGE-A6 expression is selectively 
coordinated. Furthermore, to determine whether MAGE-A3/6 
expression correlates with patient outcome, we analyzed 
whether expression of MAGE-A3/6 correlates with overall sur- 
vival. Indeed, patients with lung squamous cell carcinomas ex- 
pressing MAGE-A3 or MAGE-A6 have a significant decrease in 
overall survival time (Figures 1 1 and 1 J). Patients with tumors ex- 
pressing MAGE-A3 had a >50% reduced overall survival time 
compared to patients with MAGE-A3-negative tumors (30 versus 
69 months, respectively; 2.0 hazard ratio; Table SI). Similarly, 
patients with tumors expressing MAGE-A6 had a >50% reduced 
survival time (33 versus 71 months; hazard ratio of 1 .9; Table SI). 
Together, these results suggest MAGE-A3/6 are physiologically 
restricted to the testis in both humans and mice but are 
frequently found in a wide variety of cancer types, and their 
expression correlates with poor patient prognosis. 

MAGE-A3/6 Are Required for Cancer Cell Viability and 
Function as Oncogenes 

MAGE-A3/6 could be “passenger” genes that have little func- 
tional role or significance in tumorigenesis and are simply bio- 
markers. Alternatively, MAGE-A3/6 may be oncogenic “driver” 
genes that are involved in promoting tumor initiation and/or pro- 
gression. To determine if MAGE-A3/6 have important functional 
roles in cancer cells, we examined whether patient-derived 
breast, colon, and lung cancer cells require the expression of 
MAGE-A3/6 for viability. Indeed, knockdown of MAGE-A3/6 us- 
ing two independent small interfering RNAs (siRNAs) (Figures 
S2A and S2B) in multiple lung (HCC193, HI 648, and H2126), 
breast (HCC1806 and SK-BR-3), and colon (HCT116 and 
HT29) cancer cells resulted in a significant decrease in cell 
viability and clonogenic survival (Figures 2A-2C, S2C, and 
S2D). Importantly, these effects are likely on target, because 
MAGE-A3/6 siRNAs do not significantly alter the viability of 
MAGE-A3/6-negative cells (HCC1143 and DLD1), even though 



Figure 1. MAGE-A3 and MAGE-A6 Are Normally Restricted to Expression in the Testis but Are Aberrantly Expressed in Cancer and Predict 
Poor Patient Prognosis 

(A and B) qRT-PCR analysis (n = 3) of the normalized expression of mouse MAGE-A3 (A) and MAGE-A6 (B) in the indicated tissues from BALB/C mice. 

(C) qRT-PCR analysis (n = 3) of the normalized expression of human MAGE-A3/6 (one primer set detects both) in the indicated human tissues. 

(D and E) Percentage of patient tumors expressing MAGE-A3 (D) and MAGE-A6 (E) is shown. 

(F-H) MAGE-A3 and MAGE-A6 are co-expressed in breast invasive carcinomas (F), colorectal adenocarcinomas (G), and lung squamous cell carcinomas (H). 
(I and J) Expression of MAGE-A3 (C) or MAGE-A6 (D) in patients with lung squamous carcinomas correlates with poor overall survival. 

Data are represented as the mean ± SD. Asterisks indicate p < 0.05. See also Figure SI and Table SI . 
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Figure 2. MAGE-A3/6 Are Potent Onco- 
genes Whose Expression Is Necessary for 
Viability of Cancer Cells and Is Sufficient to 
Transform Cells 

(A-C) MAGE-A3/6 depletion reduces viability of 
MAGE-A3/6-expressing tumor cell lines. Lung (A), 
breast (B), and colon (C) cancer cells were treated 
with siControl, two distinct siMAGE-A3/6, or 
cytotoxic siUbiquitinB for a transfection control. 
Cell viability was measured by MTT assay. 

(D) MAGE-A3 and MAGE-A6 have oncogenic ac- 
tivity. NIH 3T3 fibroblasts were transfected with 
MAGE-A3, MAGE-A6, or mutant K-Ras v12 as a 
positive control, and foci formation was assayed. 
Foci were stained with crystal violet and counted. 

(E) MAGE-A6 promotes anchorage-independent 
growth of DLD1 colon cancer cells. MAGE-A- 
negative DLD1 cells stably expressing vector or 
MAGE-A6 were assayed for anchorage-indepen- 
dent growth in soft agar colony formation assays. 
(F and G) MAGE-A6 promotes anchorage-inde- 
pendent growth of non-transformed, immortalized 
human colonic epithelial cells (HCECs) without (F) 
or with (G) expression of mutant K-Ras v12 . The 
indicated HCEC cells were assayed for anchorage- 
independent growth in soft agar colony formation 
assays. 

Data (n = 3) are represented as the mean ± SD. 
Asterisks indicate p < 0.05. See also Figure S2. 



the cytotoxic siRNA targeting ubiquitin B was just as lethal in all 
cell lines (Figures 2A-2C). These results suggest that upon 
expression of MAGE-A3/6, cells become dependent on their 
expression for viability, similar to other reports of “oncogene 
addiction” (Weinstein, 2002). 

To determine whether MAGE-A3/6 are indeed oncogenic 
driver genes, we analyzed their activity in several classical as- 
says. First, expression of either MAGE-A3 or MAGE-A6 signifi- 
cantly stimulated foci formation of NIH 3T3 cells (Figure 2D). 
Furthermore, MAGE-A6 promoted other hallmarks of cancer, 
such as anchorage-independent growth of the MAGE-A3/6- 
negative DLD1 colon cancer cells (Figure 2E). Finally, to more 
stringently assay the oncogenic activity of MAGE-A3/6, we 
determined the ability of MAGE-A6 to promote tumorigenic phe- 
notypes in non-transformed, human colonic epithelial cells 
(HCECs) derived from normal colon biopsies and immortalized 
with CDK4 and hTERT (Roig et al., 2010). Remarkably, stable 
expression of MAGE-A6 alone was sufficient to drive transforma- 
tion of these cells, resulting in robust anchorage-independent 
growth in soft agar (Figure 2F). In this setting, the oncogenic ac- 
tivity of MAGE-A6 was even more robust than expression of the 
bona fide K-Ras v12 oncogene (Figure 2F). Additionally, even in 



the context of mutant K-Ras v12 expres- 
sion, MAGE-A6 was still competent to 
drive anchorage-independent growth of 
HCEC cells to a similar degree as the 
Apc min oncogene (Figure 2G). Consistent 
^ with these findings, MAGE-A3/6 drives 
increased tumor growth and metastasis 
in an orthotopic xenograft mouse model 
of thyroid cancer (Liu et al., 2008). Collectively, these findings 
suggest that MAGE-A3/6 are potent driver oncogenes that 
have therapeutic potential. 

The MAGE-A3/6-TRIM28 E3 Ubiquitin Ligase 
Ubiquitinates and Degrades AMPKal 

Previously, we reported that MAGE proteins, including MAGE- 
A3/6, form complexes with specific E3 ubiquitin ligases to regu- 
late ubiquitination (Doyle et al., 2010; Hao et al., 2013). MAGE- 
A3/6 specifically bind to the TRIM28 E3 ubiquitin ligase, also 
known as KAP1 (Doyle et al., 2010). We and others have previ- 
ously shown that MAGE-A proteins can inhibit the critical p53 
tumor suppressor, including via MAGE-A-TRIM28-induced 
ubiquitination and proteasome-mediated degradation of p53 
(Doyle et al., 2010; Marcar et al., 2010; Monte et al., 2006; 
Wang et al., 2005; Yang et al., 2007). However, several of the 
cell lines dependent on MAGE-A3/6 for viability (Figures 2A- 
2C) are p53 null (HCC1806 and HI 648) or mutant (HCC193, 
SK-BR-3, and HT-29). Additionally, MAGE-A3/6 stimulated 
anchorage-independent growth in p53 mutant DLD1 colon can- 
cer cells (Figure 2E). Furthermore, expression of MAGE-A3/6 did 
not inversely correlate with p53 mutational status (x 2 = 0.17; 
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p = 0.98). Therefore, the MAGE-A3/6-TRIM28 E3 ubiquitin ligase 
may have additional targets relevant to its function in cancer 
cells. 

To identify additional targets of the MAGE-TRIM28 E3 ubiqui- 
tin ligase, we screened for its direct substrates by performing 
in vitro ubiquitination reactions on protein microarrays contain- 
ing >9,000 SF9-purified, recombinant proteins (Figure 3A). This 
screen yielded 19 potential MAGE-TRIM28 substrates (Table 
S2) whose ubiquitination were significantly (p < 0.05) increased 
by MAGE-TRIM28. To validate the results of the screen, candi- 
dates were tested for regulation by MAGE-A3/6-TRIM28, with 
AMPKal being the most consistent and robust target of 
MAGE-A3/6-TRIM28 (Figures 3B-3D). Multiple siRNAs targeting 
MAGE-A3/6 or TRIM28 decreased ubiquitination of AMPKal 
(Figures 3B and S2F). Expression of MAGE-A3 in MAGE-A3/6- 
negative cells promoted ubiquitination of AMPKal (Figure 3C). 
Furthermore, knockdown of MAGE-A3/6 or TRIM28 resulted in 
increased AMPKal protein levels (Figure 3D), without affecting 
mRNA levels (Figure 3E). Additional subunits of the AMPK holo- 
enzyme complex, such as AMPKpl and AMPKyl, were corre- 
spondingly elevated (Figure S2G). In contrast, expression of 
MAGE-A3 in MAGE-A3/6-negative cells decreased AMPKal 
protein levels (Figure 3F), without affecting mRNA levels (Fig- 
ure S2E). Notably, AMPKal protein levels could be rescued by 
addition of the MG132 proteasome inhibitor (Figure 3F), suggest- 
ing that MAGE-A3/6-TRIM28 ubiquitination of AMPKal leads to 
its degradation by the proteasome. 

To determine if MAGE-A3/6 enhances degradation of AMPKal 
by TRIM28, as is the case for p53 (Doyle et al., 201 0), or if MAGE- 
A3/6 is required for specifying AMPKal degradation by TRIM28, 
we examined whether TRIM28 regulates AMPKal levels in 
MAGE-A3/6-negative cells. Knockdown of TRIM28 in MAGE- 
A3/6-negative cells had no effect on AMPKal levels (Figure 3G), 
suggesting that MAGE-A3/6 is required for AMPKal degradation 
by TRIM28. Consistent with this notion, the AMPKal (31 yl holo- 
enzyme complex (Figure S2H) and specifically AMPKal bound 
to recombinant GST-MAGE-A3 and GST-MAGE-A6, but not 
GST-TRIM28 in vitro (Figure 3H). Additionally, overexpressed 
and endogenous AMPKal co-immunoprecipated with MAGE- 
A3 from cells (Figures 81 and S2I). These results suggest that 
MAGE-A3/6 specifies ubiquitination of AMPKal by the TRIM28 
ubiquitin ligase. 

Inhibition of AMPK by MAGE-A3/6-TRIM28 Impacts 
Cellular Metabolic Processes 

Next, we determined whether modulation of AMPK protein levels 
by MAGE-A3/6-TRIM28 had a functional impact on AMPK activ- 
ity and the cellular metabolic processes it controls. Knockdown 
of MAGE-A3/6 orTRIM28 increased both total and active (phos- 
phorylated, pT172) AMPKal (Figure 4A). Furthermore, the 
knockdown of MAGE-A3/6 or TRIM28 resulted in increased 
phosphorylation of ACC1 (Figure 4B), a prototypical target of 
AMPK (Carling et al., 1987). Although AMPK generally antago- 
nizes the Warburg effect (Faubert et al., 2013), acute stimulation 
of AMPK can promote glycolysis through a variety of activities, 
including plasma membrane localization of the GLUT1 glucose 
transporter and increased glucose consumption (Barnes et al., 
2002; Plardie et al., 2012b). Knockdown of MAGE-A3/6 or 



TRIM28 resulted in increased plasma membrane localized 
GLUT1 (Figure 4C). Furthermore, knockdown of TRIM28 in- 
creased glucose consumption (Figure 4D) and correspondingly 
increased lactate production (Figure 4E) in MAGE-A3/6-positive 
cells. These results suggest that MAGE-A3/6-TRIM28 has a 
functional impact on cellular metabolism. 

In addition to regulating cellular glucose metabolism, AMPK is 
well documented to inhibit anabolic processes, such as mTOR- 
dependent protein synthesis, to conserve energy in the context 
of reduced ATP levels (Gwinn et al., 2008; Inoki et al., 2003). 
Therefore, we knocked down MAGE-A3/6 or TRIM28 and fol- 
lowed mTOR activity by examining phosphorylation of p70 ribo- 
somal S6 kinase and ribosomal S6 protein. Upon knockdown of 
MAGE-A3/6 or TRIM28, mTOR signaling was severely inhibited 
and phosphorylation of both p70 ribosomal S6 kinase and ribo- 
somal S6 protein were reduced (Figure 4F). Similarly, amino- 
acid-induced mTOR activity was significantly reduced upon 
depletion of MAGE-A3/6 (Figure 4G). Importantly, reduction in 
basal mTOR activity was rescued by treatment with the AMPK 
inhibitor, compound c (Figure 4H), or co-depletion of AMPKal 
(Figure 41). Together, these results suggest that the MAGE-A3/ 
6-TRIM28 ubiquitin ligase is functionally important for mainte- 
nance of mTOR activity, likely through inhibition of AMPK. 

MAGE-A3/6-TRIM28 Ubiquitin Ligase Inhibits 
Autophagy 

Because MAGE-A3/6-TRIM28 regulates both AMPK and mTOR 
activities and both of these signaling pathways converge to op- 
posingly modulate autophagy (Egan et al., 2011; Kim et al., 
2011), we examined whether MAGE-A3/6-TRIM28 influences 
autophagy. One mechanism by which AMPK and mTOR regulate 
autophagy is through phosphorylation of the proximal ULK1 ki- 
nase required for autophagosome formation. AMPK phosphory- 
lation of ULK1 S555 promotes ULK1 activity and autophagy, 
whereas mTOR phosphorylation of ULK1 S757 inhibits ULK1 
activity and autophagy (Egan et al., 2011; Kim et al., 2011). 
Knockdown of MAGE-A3/6 orTRIM28 upregulated ULK1 S555 
phosphorylation (AMPK site) and downregulated ULK1 S757 
phosphorylation (mTOR site) (Figure 5A). Changes in ULK1 phos- 
phorylation by MAGE-A3/6 or TRIM28 knockdown were accom- 
panied by the expected increase in GFP-LC3 puncta, a marker of 
autophagy (Figures 5B and S3A-S3C). The increased GFP-LC3 
puncta in MAGE-A3/6- and TRIM28-depleted cells was blocked 
by co-depletion of ULK1 (Figure 5C). The magnitude of increased 
GFP-LC3 puncta upon MAGE-A3/6 orTRIM28 knockdown was 
similar to knockdown of mTOR, an established potent inhibitor 
of autophagy (Figure 5B). To complement our results using cells 
stably expressing GFP-LC3, we examined the number of endog- 
enous LC3 puncta upon knockdown of MAGE-A3/6. Similarly to 
GFP-LC3, siRNAs targeting MAGE-A3/6 induced the accumula- 
tion of endogenous LC3 puncta in MAGE-A3/6-positive cells but 
had no effect in MAGE-A3/6-negative cells (Figures 5D and 5E). 
Importantly, short-term inhibition of AMPK with compound c 
attenuated MAGE-A3/6-RNAi-induced GFP-LC3 puncta forma- 
tion (Figure 5F). 

Since an increase in LC3 puncta may represent either a block 
in autophagosome fusion with lysosomes or an increase in auto- 
phagy, we measured the consumption (levels) of GFP-LC3 by 
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Figure 3. MAGE-A3/6-TRIM28 E3 Ubiquitin Ligase Ubiquitinates and Degrades AMPKal 

(A) Schematic of in vitro screen for MAGE-TRIM28 ubiquitination substrates using protein arrays. 

(B) AMPKal ubiquitination requires MAGE-A3/6-TRIM28. HeLa (MAGE-A3/6-positive) were treated with the indicated siRNAs for 24 hr before transfection with 
Myc -tagged ubiquitin for 48 hr before anti-Myc immunoprecipitation (IP) and immunoblotting was performed (n = 3). 

(C) Expression of MAGE-A3 promotes AMPKal ubiquitination. MAGE-A3/6-negative HEK293 cells stably expressing FLAG-MAGE-A3 were transfected with 
Myc-ubiquitin 48 hr before anti-Myc IP and immunoblotting was performed (n = 3). 

(D) Knockdown of MAGE-A3/6-TRIM28 increases AMPKal protein levels. MAGE-A3/6-positive cells were treated with the indicated siRNAs for 72 hr and then 
blotted for the indicated proteins (n = 3). 

(E) Knockdown of MAGE-A3/6-TRIM28 does not affect AMPKal mRNA levels. MAGE-A3/6-positive cells were treated with the indicated siRNAs for 72 hr and 
then AMPKal mRNA levels were determined by qRT-PCR (n = 3). Data are represented as the mean ± SD. Asterisks indicate p < 0.05. 
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flow cytometry. We observed a significant decrease in GFP-LC3 
fluorescence upon knockdown of MAGE-A3/6 or TRIM28, and 
this was again similar to the degree of GFP-LC3 consumption 
upon mTOR depletion (Figures 5G and S3D). These results 
were further confirmed by western blotting where knockdown 
of MAGE-A3/6 or TRIM28 promoted a marked decrease in 
GFP-LC3 protein levels (Figure 5H). These changes in GFP- 
LC3 were not due to alterations in GFP-LC3 mRNA levels (Fig- 
ure S3E). To determine if the decrease in LC3 was the conse- 
quence of increased autophagic flux, we treated cells depleted 
of MAGE-A3/6 orTRIM28 with bafilomycin A1 to prevent acidifi- 
cation of lysosomes and degradation of proteins by autophagy. 
Short-term treatment of cells with bafilomycin A1 blocked 
consumption of GFP-LC3 by knockdown of MAGE-A3/6 or 
TRIM28 (Figure 51). Finally, we examined the levels of autophagy 
in MAGE-A3/6- or TRIM28-depleted cells by an independent 
measure, consumption of the p62/SQSTM1 autophagy adaptor. 
Similarly to LC3, endogenous p62/SQSTM1 was consumed 
upon knockdown of MAGE-A3/6 or TRIM28 (Figure 5J), and 
this could be rescued by bafilomycin A1 (Figure 5K). Further- 
more, the ability of MAGE-A3/6 to inhibit autophagy was also 
confirmed by expression of MAGE-A3 in normal, non-trans- 
formed cells that typically are negative for MAGE-A3/6. MAGE- 
A3 expression induced the degradation of AMPKal and the 
accumulation of p62/SQSTM1 (Figure 5L), consistent with 
reduced autophagy in these cells. Collectively, these results 
suggest that MAGE-A3/6-TRIM28 inhibits autophagy and that 
depletion of MAGE-A3/6 orTRIM28 dramatically increases auto- 
phagic flux. 

MAGE-A3/6 Regulation of AMPKal Is Relevant 
in Human Tumors 

Our results suggest that the oncogenic MAGE-A3/6-TRIM28 
ubiquitin ligase regulates several cellular metabolic regulatory 
pathways through ubiquitination and degradation of AMPKal. 
To determine the relevance of these findings to human tumors, 
we examined whether MAGE-A3/6 expression inversely corre- 
lated with AMPK activity and protein levels in patient tumor 
samples. Indeed, breast invasive carcinoma (Figure S4A), 
colon adenocarcinoma (Figure 6A), and lung squamous cell 
carcinoma (Figure 8B) tumors expressing MAGE-A3/6 had 
significantly reduced total and active (phospho-T172) AMPKa 
protein levels. This reduction was not a consequence of de- 
creased AMPKal mRNAs in these tumors (Figures 6A, 6B, 
and S4A). Consistent with these findings, the phosphorylated 
form of AMPK is downregulated in high proportion of cases 
of breast cancer (Hadad et al., 2009). In addition, MAGE-A3/ 
6 expression in tumors correlated with reduced downstream 
AMPK signaling, such as increased markers of mTOR activity 
(Figure S4B). 



Finally, AMPK agonists are of significant interest in treatment 
and prevention of cancer (Hardie et al., 201 2a). Thus, we deter- 
mined whether AMPK agonists could reverse the phenotypes of 
MAGE-A3/6 driven anchorage-independent growth and cancer 
cell viability. The AMPK activating compounds, aminoimidazole 
carboxamide ribonucleotide (AICAR) and metformin, sup- 
pressed the ability of MAGE-A6 to promote anchorage-inde- 
pendent growth of normal HCEC cells and DLD1 colon cancer 
cells in a dose-dependent manner (Figures S4D-S4F). Impor- 
tantly, these effects were specific to MAGE-A6-expressing 
cells as AICAR and metformin minimally affected Apc min or 
MAGE-BIO driven anchorage-independent growth of HCEC 
cells (Figures S4D and S4E). Since the cellular effects of both 
AICAR and metformin extend beyond just activation of AMPK, 
including affecting mitochondrial respiration (Hardie et al., 
2012a), we also examined whether a direct allosteric activator 
of AMPK, A769662 (Cool et al., 2006; Landgraf et al., 2013), 
or genetic manipulation of AMPKal could alter phenotypes 
associated with MAGE-A3/6. MAGE-A6-induced, but not Apc- 
min - or MAGE-BIO-induced anchorage-independent growth of 
HCEC and DLD1 cells was significantly impaired by A769662 
in a dose-dependent manner (Figures 6C and S4G). Further- 
more, co-depletion of AMPKal rescued MAGE-A3/6-RNAi- 
induced decrease in cell viability (Figure 6D). Taken together, 
these results suggest that regulation of AMPK by MAGE-A3/6 
is relevant to human tumors and pharmacological agonists of 
AMPK may have therapeutic potential in MAGE-A3/6-positive 
tumors. 

DISCUSSION 

AMPK senses and responds to the energy status of cells to regu- 
late multiple metabolic processes and limit energy expenditure. 
Significant effort has been directed toward understanding the 
role and dysregulation of AMPK in cancer. One known mecha- 
nism of reducing AMPK activity in cancer is mutation/deletion 
of its upstream regulatory kinase Lkbl/Stkll. However, this is 
a rare event in most tumor types other than lung adenocarci- 
nomas and cervical cancers (Wingo et al., 2009). In this study, 
we demonstrate that the MAGE-A3/6-TRIM28 E3 ligase complex 
ubiquitinates and degrades AMPKal . Thus, the prominent acti- 
vation of MAGE-A3/6 expression in many cancer types may 
represent an alternative mechanism for downregulating the 
AMPK signaling pathway (Figure S5). Consistent with this, 
expression of MAGE-A3/6 and mutation of Lkbl/Stkl 1 are rarely 
found in the same lung adenocarcinoma tumors (Figure S4C, 

p<0.01). 

MAGE-A3/6 are normally exclusively expressed in the testis 
but are frequently turned on in many tumor types, including co- 
lon, lung, and breast tumors (Figure 1). In combination with 



(F) MAGE-A3 promotes proteasome-dependent AMPKal degradation. MAGE-A3/6-negative cells expressing vector of MAGE-A3 were treated with 5 MG1 32 

for 4 hr before immunoblotting (n = 3). 

(G) TRIM28-mediated AMPKal degradation requires MAGE-A3/6. MAGE-A3/6-negative HEK293 cells were transfected with the indicated siRNAfor 72 hr before 
cell lysates were immunoblotted (n > 3). 

(FI) GST pull-down assays reveal AMPKal directly binds to MAGE-A3 and MAGE-A6, but not TRIM28 (22-418) or GST (n = 3). 

(I) HeLa cells expressing FLAG-MAGE-A3 or FLAG-vector along with Myc-AMPKal were subjected to anti-Myc IP and immunoblotting. 

WCL, whole-cell lysate. Data are representative of multiple experiments (n = 2). See also Table S2. 
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previous studies, our findings suggest that activation of MAGE- 
A3/6 in cancer cells is not a by-product, passenger event during 
cellular transformation and tumorigenesis, but rather MAGE-A3/ 
6 are driver genes that support multiple phenotypes associated 
with tumors, including metabolic dysregulation. We propose that 
one critical oncogenic function of MAGE-A3/6 is downregulation 
of AMPK and alteration of cellular metabolism in cancer cells. 
Strikingly, this mode of AMPK regulation does not occur in 
normal somatic cells that do not express MAGE-A3/6 but only 
occurs upon reactivation of the testicular MAGE-A3/6 program 
in cancer cells. 

Although AMPK coordinates many different actions in the 
cell, one key process it controls is autophagy. While the role 
of autophagy in the progression of cancer is multifaceted, 
loss of autophagy has been implicated in the initiation of tumor- 
igenesis (Choi et al., 201 3; Wei et al., 201 3; White, 201 2). Our re- 
sults suggest that aberrant activation of MAGE-A3/6 in tumors 
may provide a unique mechanism for inhibition of tumor- 
suppressive autophagy during tumor initiation. Interestingly, 
MAGE-A3/6 expression is undetectable in never-smokers but 
is aberrantly found in the lungs of smokers before they have 
any clinical signs of disease (Jang et al., 2001). Thus, MAGE- 
A3/6 expression may occur early during tumor initiation and 
could be one mechanism to downregulate autophagy during 
this stage. Identification of the factors that regulate MAGE- 
A3/6 expression in adult tissues may provide insights into 
understanding events leading to tumor initiation. One major reg- 
ulatory mechanism controlling expression of MAGE cancer- 
testis antigens is promoter CpG methylation in normal somatic 
cells (Simpson et al., 2005). However, simple demethylation of 
MAGEs is not sufficient to drive expression (Weber et al., 
1994). The identification of additional transcriptional regulators 
will be of utmost importance. 

Our findings of the association of MAGE-A3/6 expression with 
AMPK degradation in human tumors has important and poten- 
tially immediate implications on the utilization of AMPK activating 
compounds, such as metformin and A769662, that are vigor- 
ously being pursued in the prevention and treatment of cancer 
(Quinn et al., 2013). While AMPK activating drugs are currently 
in clinical trials for treatment of a variety of tumor types, the early 
results thus far have been mixed with no apparent explanation 
(Quinn et al., 2013). We propose that MAGE-A3/6 expression 
status may be a useful enrollment biomarker to select patients 
with the greatest potential response to AMPK agonists. Addition- 
ally, since MAGE-A3/6 expression increases signaling through 



the mTOR pathway, the use of currently approved mTOR inhib- 
itors may be effective in the future treatment of MAGE-A3/6- 
driven tumors. 

Little is known about the physiological role of MAGE-A3/6 in 
the testis. Our findings on the molecular and cellular functions 
of MAGE-A3/6-TRIM28 in cancer provide intriguing insights 
into their normal physiological function during spermato- 
genesis. Interestingly, germ cells in the testis switch their car- 
bon energy sources as they differentiate from spermatogonia 
stem cells to maturing haploid spermatids (Nakamura et al., 
1984). We propose that MAGE-A3/6 may function to protect 
maturing spermatocytes from energy stress by dampening 
AMPK activation. Also, MAGE-A3/6 might enable develop- 
mental stage-dependent activation of anabolic pathways 
required for normal spermatogenesis, such as lipid and protein 
synthesis. Furthermore, developing spermatocytes also ex- 
press an unusual splice variant of LKB1 with a different C-ter- 
minal region, which is required for spermiogenesis (Towler 
et al., 2008). Consistent with these ideas, we have found 
that mouse MAGE-A genes are highly expressed in pre-pachy- 
tene spermatocytes (data not shown) where these regulator 
events are occurring and previous studies have shown that 
testis-specific knockout of TRIM28 blocks spermatocyte dif- 
ferentiation, resulting in testis degeneration (Weber et al., 
2002). 

In summary, our findings illuminate a previously unrecognized, 
widespread regulation of AMPK during tumorigenesis by a 
testis-specific ubiquitin ligase, provide an unprecedented mo- 
lecular mechanism by which MAGE cancer-testis antigens drive 
tumorigenesis, and have important implications to maximizing 
the clinical utility of AMPK-directed chemotherapies. 

EXPERIMENTAL PROCEDURES 
Cell Culture and Transfections 

Cells were cultured under standard conditions and transfected according to 
manufacturer’s recommendation. Detailed descriptions of cell-culture condi- 
tions, transfection procedures, siRNA sequences, and antibodies are 
described in Extended Experimental Procedures. 

RNA Preparation and qRT-PCR 

Preparation of RNA from tissues and cells and qRT-PCR analysis was per- 
formed by standard molecular biology techniques and described in Extended 
Experimental Procedures. All procedures and use of mice were approved by 
the Institutional Animal Care and Use Committee of UT Southwestern Medical 
Center. 



Figure 4. MAGE-A3/6-TRIM28 Ubiquitin Ligase Regulates AMPK-Controlled Metabolic Processes 

(A and B) MAGE-A3/6-TRIM28 knockdown increases phospho-AMPK (A) and phospho-ACCI (B) signaling. HeLa or U20S cells were treated with the indicated 
siRNAs for 72 hr before cell lysates were immunoblotted (n = 3). 

(C) MAGE-A3/6-TRIM28 knockdown increases Glutl plasma membrane localization. HeLa cells were treated with the indicated siRNAs for 72 hr before 
immunostaining for Glutl (n = 3). Scale bar, 20 jxm. 

(D and E) MAGE-A3/6-TRIM28 knockdown alters glucose metabolism. HeLa cells were treated with MAGE-A3/6 or TRIM28 siRNA for 72 hr and then fed with fresh 
media. After 6 hr, media was collected and glucose (D) and lactate (E) levels in media were analyzed via nova analyzer. Data (n = 3) represent mean ± SD. 

(F) MAGE-A3/6-TRIM28 is required for mTOR signaling. HeLa or U20S cells were treated with the indicated siRNAs for 72 hr before immunoblotting (n = 3). 

(G) MAGE-A3/6-TRIM28 is required for amino-acid-induced mTOR activity. HeLa cells were treated with siControl, siMAGE-A3/6, or siRagC (positive control) 
siRNAs for 72 hr before 6 hr starvation in EBSS (0 min) or starvation followed by 30 min amino acid stimulation (n = 3). 

(H and I) Inhibition of AMPK reverses mTOR inhibition by MAGE-A3/6 or TRIM28 knockdown. Cells were transfected with the indicated siRNAs for 72 hr before 
collection and immunoblotting (I) or treatment for 4 hr with 10 [xM compound C or vehicle (DMSO) (H) (n > 2). 

Asterisks indicate p < 0.05. 
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Figure 5. MAGE-A3/6-TRIM28 Ubiquitin Ligase Inhibits Autophagy 

(A) HeLa or U20S cells were transfected with siRNA for 72 hr before immunoblotting (ULK1 pS555 AMPK target site; ULK1 pS757 mTOR target site) 
(n > 3). 

(legend continued on next page) 
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Figure 6. Regulation of AMPK by MAGE-A3/6-TRIM28 Is Relevant to Human Tumors 

(A and B) TCGA data were analyzed for MAGE-A3/6 mRNA levels and total and active (pT172) AMPKa protein levels. Data are mean ± SE with number (n) of 
tumors indicated. Asterisks indicate p < 0.01. 

(C) Anchorage-independent growth assays of the indicated HCEC cells were performed in presence of the indicated concentrations of A769662. Number of 
colonies >100 pm were counted (n = 3). Data are mean ± SD. Asterisks indicate p < 0.05. 

(D) Colony formation assays were performed in HeLa cells treated with the indicated siRNAs. Data are mean ± SD, n = 3. Asterisks indicate p < 0.05. 

See also Figures S4 and S5. 



Colony Formation and Anchorage-Independent Growth Soft Agar 
Assays 

For colony formation assays on plastic, cells were transfected for 72 hr 
with siRNAs and then re-plated at single-cell density. After 3 weeks, 
cells were fixed and stained with crystal violet (0.05% [w/v]) and counted. 
For anchorage-independent growth soft agar growth assays, cells 
were suspended in 0.375% Noble agar (Difco) supplemented with 



regular growth medium and overlaid on 0.5% Noble agar. Cells were al- 
lowed to grow for 2-4 weeks before colonies >100 pm in size were 
counted. 

Immunofluorescence and Microscopy 

Immunofluorescence was performed essentially as described previously (Flao 
et al., 2013) and in Extended Experimental Procedures. 



(B) U20S cells stably expressing GFP-LC3 were treated with the indicated siRNAs for 72 hr before imaging and quantitation of GFP-LC3 puncta (n = 3). Boxplots 
are mean and quartiles. 

(C) U20S cells stably expressing GFP-LC3 were treated with the indicated siRNAs for 72 hr before imaging (n = 3). Boxplots represent mean and quartiles of n > 50 
cells. 

(D and E) Seventy-two hours after transfection, cells were stained for endogenous LC3 (D). Scale bars, 20 pm. Boxplots of number of LC3 puncta per cell (E). 
Boxplots are mean and quartiles of n > 50 cells. 

(F) U20S cells were transfected with the indicated siRNA for 72 hr before treatment for 4 hr with 1 0 pM compound C or vehicle (DMSO). Boxplots represent mean 
and quartiles of n > 50 cells. 

(G) U20S GFP-LC3 cells were transfected with the indicated siRNA for 72 hr. Median GFP fluorescent intensity ±SD as determined by flow cytometry is shown 
(n = 3). 

(H) Cells stably expressing GFP-LC3 were treated with siRNAs for 72 hr before immunoblotting (n > 3). 

(I) Knockdown of MAGE-A3/6 or TRIM28 increases autophagic flux. GFP-LC3 cells were transfected with the indicated siRNA for 72 hr. Cells were treated with 
bafilomycin A for 4 hr before cell lysates were immunoblotted (n = 3). 

(J and K) Knockdown of MAGE-A3/6 or TRIM28 increases p62 consumption. HeLa or U20S cells were treated with the indicated siRNA for 72 hr. Cell lysates 
were collected (J) or cells were treated with DMSO or bafilomycin A for 4 hr before cell lysates were collected (K), and the indicated proteins were detected by 
immunoblotting (n > 2). 

(L) MAGE-A3/6-negative HBEC cells were stably transfected with Myc-MAGE-A3, and cell lysates were immunoblotted (n = 2). 

Asterisks indicate p < 0.05. See also Figure S3. 
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In Vitro Ubiquitination Screen 

ProtoArray containing >9,000 GST-tagged recombinant proteins purified from 
SF9 insect cells was purchased from Invitrogen. In vitro ubiquitination on the 
slide was performed according to manufacture instructions with minor modi- 
fications described in Extended Experimental Procedures. 

Glucose Consumption and Lactate Measurements 

Twenty-four hours after plating, cells were transfected with siRNA. Seventy- 
two hours after siRNA transfections, cells were changed into fresh media for 
6 hr. Media was collected and analyzed using Nova Analyzer to quantitate 
amount of glucose and lactate in the media. 

Recombinant Protein Purification and In Vitro Binding Assays 

Recombinant proteins were produced using standard procedures described in 
Extended Experimental Procedures. In vitro binding assays were performed as 
described previously (Doyle et al., 2010; Hao et al., 2013) and specified in the 
Experimental Procedures. 

Assessment of mRNA/Protein Expression Levels in Human Tumors 
and Statistical Analysis 

mRNA levels, survival data, and mutational status were determined using the 
cancer genome atlas. Tumor protein expression levels were determined previ- 
ously by reverse-phase protein arrays performed on tumors with matching 
RNA sequencing data (Cancer Genome Atlas Research Network, 2014). 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Extended Experimental Procedures, five 
figures, and two tables and can be found with this article online at http://dx. 
doi.org/1 0. 1 01 6/j.cell.201 5.01 .034. 
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SUMMARY 

Signaling through RAS/MAP kinase pathway is cen- 
tral to biology. ERK has long been perceived as 
the only substrate for MEK. Here, we report that 
HSF1 , the master regulator of the proteotoxic stress 
response, is a new MEK substrate. Beyond medi- 
ating cell-environment interactions, the MEK-HSF1 
regulation impacts malignancy. In tumor cells, MEK 
blockade inactivates HSF1 and thereby provokes 
proteomic chaos, presented as protein destabiliza- 
tion, aggregation, and, strikingly, amyloidogenesis. 
Unlike their non-transformed counterparts, tumor 
cells are particularly susceptible to proteomic pertur- 
bation and amyloid induction. Amyloidogenesis is 
tumor suppressive, reducing in vivo melanoma 
growth and contributing to the potent anti-neoplastic 
effects of proteotoxic stressors. Our findings unveil a 
key biological function of the oncogenic RAS-MEK 
signaling in guarding proteostasis and suppressing 
amyloidogenesis. Thus, proteomic instability is an 
intrinsic feature of malignant state, and disrupting 
the fragile tumor proteostasis to promote amyloido- 
genesis may be a feasible therapeutic strategy. 

INTRODUCTION 

Following environmental challenges, cells stimulate production 
of heat-shock proteins (HSPs). This HSP induction is the hall- 
mark of the heat-shock, or proteotoxic stress, response (PSR) 
(Lindquist, 1986). As molecular chaperones, HSPs facilitate 
folding, transportation, and degradation of other proteins (Mori- 
moto, 2008). In guarding the proteome against misfolding and 
aggregation, the PSR preserves proteostasis (Balch et al., 2008). 

In vertebrates, heat shock transcription factors (HSFs) govern 
the PSR. Among them is HSF1, the master regulator of this 
response (Morimoto, 2008; Xiao et al., 1999). As a multi-step 
process, HSF1 activation entails trimerization, nuclear transloca- 
tion, posttranslational modifications, and DNA binding (Mori- 
moto, 2008). Yet, our understanding of this process remains 
incomplete. 

The HSF1 -mediated PSR antagonizes many pathological 
conditions, including hyperthermia, heavy-metal toxification, 
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ischemia, and reperfusion, and oxidative damage, and impacts 
aging and neurodegeneration (Dai et al., 2012a). HSF1, not sur- 
prisingly, acts as a longevity factor (Hsu et al., 2003). In contrast, 
our and others’ work has revealed a pro-oncogenic role of HSF1 
(Dai et al., 2007, 2012b; Jin et al., 2011; Meng et al., 2010; Min 
et al., 2007). Despite its dispensability under non-stress condi- 
tions, HSF1 is crucial for tumor cells’ growth and survival (Dai 
et al., 2007). Nonetheless, the mechanisms underlying its activa- 
tion in malignancy remain unclear. 

Here, we report that RAS-MEK-ERK signaling critically regu- 
lates the PSR. It is MEK that phosphorylates and activates 
HSF1. MEK inhibition destabilizes the proteome, provoking 
protein aggregation and amyloidogenesis. Combinatorial pro- 
teasome blockade potently augments this tumor-suppressive 
amyloidogenic effect. Hence, our findings not only suggest 
HSF1 as a new MEK substrate but also uncover a biological 
function of RAS-MEK-ERK signaling in governing proteostasis. 
Beyond shifting the canonical view of RAS-MEK-ERK signaling, 
our proof-of-concept experiments suggest that intrinsic proteo- 
mic instability associated with malignant state may be exploited 
to combat cancer. 

RESULTS 

MEK and ERK Inversely Regulate the PSR 

Phosphorylation notably impacts HSF1 activation (Guettouche 
et al., 2005), suggesting a key role of signaling pathways. 
To illuminate how such pathways regulate the PSR, we 
first examined their responses to stress, focusing on RAS- 
MEK-ERK signaling. To inflict proteotoxic stress, we applied 
stressors with diverse mechanisms of action, including heat 
shock (HS), proteasome inhibitor MG132, histone deacetylase 
6 inhibitor tubastatin, amino-acid analog azetidine, and HSP in- 
hibitors (17-DMAG for HSP90 and VER1 55008 for HSP70) (Ka- 
waguchi et al., 2003; Massey et al., 2010; Morimoto, 2008; 
Neckers and Workman, 2012). Transient exposure to stressors 
did not impair cell viability (Figure SI A), but elevated phosphor- 
ylation of MEK and ERK (Figure 1A), two key components of 
this pathway. MEK Ser21 8/222 and ERK Thr202/Tyr204 phos- 
phorylation signify their active state (Dhillon et al., 2007; Roux 
and Blenis, 2004). Congruently, all stressors activated ELK1 
(Figure IB), a transcription factor downstream of ERK (Roux 
and Blenis, 2004). 

To determine whether MEK-ERK signaling regulates the 
PSR, we employed U0126 and AZD6244, two specific MEK1/2 
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Figure 1. MEK and ERK Oppositely Regu- 
late the PSR 

(A) NIH 3T3 cells were treated with HS at 43° C for 
30 min, 10 pM tubastatin A for 5 hr, 40 pM 
VER1 55008 for 1 hr, 500 nM MG132 for 1 hr, 
200 nM 17-DMAG for 1 hr, and 2.5 mM azetidine 
for 15 min. 

(B) The dual ELK1 reporter system, comprising a 
serum response element (SRE)-driven secreted 
embryonic alkaline phosphatase (SEAP) plasmid 
and a CMV-driven Gaussia luciferase (GLuc) 
plasmid, was transfected into HEK293T cells. 
After 24 hr, cells were treated as in (A) and 
recovered overnight before measuring reporter 
activities (mean ± SD, n = 6, ANOVA). 

(C and D) NIH 3T3 cells were treated with 20 pM 
U0126 or 20 nM AZD6244 for 3 hr followed by HS 
and 4 hr recovery. mRNA levels were quantitated 
by qRT-PCR (mean ± SD, n = 3, Student’s t test). 

(E) Immediately after HS, nuclear proteins of NIH 
3T3 cells treated as in (C) were extracted to 
measure HSF1-DNA binding by an ELISA-based 
assay (mean ± SD, n = 3, ANOVA). 

(F) HEK293T cells were transfected with dual 
HSF1 reporter plasmids, a heat-shock element 
(HSE)-driven SEAP plasmid and a CMV-GLuc 
plasmid. After 24 hr, cells were treated with 20 pM 
U0126, 20 nM AZD6244, 1 pM FR1 80204, or 
100 nM Sch772984 for 3 hr followed by HS for 
30 min and overnight recovery (mean ± SD, n = 6, 
ANOVA). 

(G) HEK293T cells were treated with different in- 
hibitors overnight. 

(H-K) A LacZ or MEK isoform plasmid was co- 
transfected with dual HSF1 reporter plasmids into 
HEK293T cells transduced with lentiviral small 
hairpin RNAs (shRNAs). After 24 hr, cells were heat 
shocked at 43°C for 30 min followed by overnight 
recovery (mean ± SD, n = 3, ANOVA). 

(L-O) HEK293T cells were transfected with small 
interfering RNAs (siRNAs) for 48 hr followed by 
transfection with dual HSF1 reporter plasmids for 
24 hr before HS (mean ± SD, n = 6, ANOVA). 

See also Figure SI . 
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inhibitors (Favata et al., 1998; Yeh et al., 2007). Both inhibitors 
impeded the HS-induced transcription of Hsp genes and 
impaired the DNA-binding capacity and transcriptional activa- 
tion of HSF1 (Figures 1 C-1 F and SI B), suggesting that MEK ac- 
tivates the HSF1 -mediated PSR. ERK, phosphorylated by MEK 
(Ahn et al., 1991), is widely recognized as the master effector 
of this pathway (Dhillon et al., 2007; Roux and Blenis, 2004). Sur- 
prisingly, ERK inhibitors, FR1 80204 and Sch772984 (Ohori et al., 
2005; Morris et al., 2013), activated HSF1 (Figures IF and SIC). 
Both MEK and ERK inhibitors impaired two ERK-mediated 
events— MSK1 phosphorylation and ELK1 activation (Figures 
1G and SID) (Roux and Blenis, 2004). While MEK inhibitors 
reduced ERK phosphorylation, two ERK inhibitors showed 
distinct effects (Figure 1G). Sch772984 suppressed ERK phos- 
phorylation, likely due to ERK conformational changes that block 
MEK-mediated phosphorylation (Morris et al., 201 3); conversely, 
FR1 80204 promoted ERK phosphorylation (Figure 1G), suggest- 
ing feedback MEK activation. 

The impacts of MEK and ERK inhibitors on HSF1 were vali- 
dated via genetic depletions of MEK and ERK (Figures 11-10). 
While depletion of one ERK isoform diminished the other isoform 
at the protein level (Figures 1 L and 1 N), mRNA levels of the iso- 
form not targeted were elevated (Figures SI E-SI H), suggesting 
posttranscriptional mechanisms underlying reduced proteins. 
These results not only pinpoint RAS-MEK-ERK signaling as a 
key regulator of the PSR, but also reveal divergent impacts of 
MEK and ERK on HSF1 . 

MEK Physically Interacts with HSF1 

To determine whether MEK directly activates HSF1, we exam- 
ined endogenous MEK-HSF1 interactions by co-immunopre- 
cipitation (colP). While no evident MEK1/2 proteins were 
precipitated with HSF1 without HS, HS caused a marked 
colP (Figures 2A and 2B), showing a stress-inducible MEK- 
HSF1 interaction. The mobility shift of HSF1 marks HS-induced 
phosphorylation (Figure 2A). These MEK-HSF1 interactions 
were verified via expression of recombinant proteins (Figures 
S2A and S2B). To determine whether MEK and HSF1 are in 
direct contact, we employed the Proximity Ligation Assay 
(PLA) technique (Clausson et al., 2011). Antibody specificities 
were validated by immunostaining (Figures S2C and S2D). 
In /V/EK-proficient cells, PLA signals were marginally visible 
without HS and HS intensified these signals (Figure 2C). In 
/V/EK-deficient cells, only faint signals were detected even after 
HS (Figure 2C), confirming the specificity of PLA. Of note, PLA 
signals were more manifest in the nucleus than in the cyto- 
plasm (Figure 2C), revealing a prominently nuclear localization 
of interactions. These results strongly suggest a direct MEK- 
HSF1 association. 

MEK1 and MEK2 form either homo- or heterodimers in vivo 
(Catalanotti et al., 2009). To address which type of dimer binds 
HSF1, we examined MEK1-HSF1 interactions in the deficiency 
of MEK2. Under HS, more MEK1 proteins were precipitated 
with HSF1 in /V/EK2-deficient cells (Figure 2D). Similarly, MEK1 
deficiency heightened MEK2-HSF1 interactions (Figure 2E), 
revealing a competition between the two MEK isoforms for 
HSF1 binding and suggesting that MEK homodimers can 
interact with HSF1 . 



ERK Suppresses MEK-HSF1 Interactions 
to Inactivate HSF1 

To elucidate how ERK inactivates HSF1, we first examined the 
impact of ERK on MEK-mediated HSF1 activation. Whereas 
ERK1 depletion promoted MEK-HSF1 interactions (Figure 2F), 
ERK1 overexpression mitigated these interactions and sup- 
pressed HSF1 (Figures 2G and S2E). Thus, we contemplated 
three possible scenarios (Figure 2H): (1) both MEK substrates, 
ERK and HSF1, compete for MEK interaction; (2) ERK, like 
MEK, binds HSF1 and thereby competes for HSF1 interaction; 
and (3) ERK inhibits MEK kinase activity toward HSF1. Each of 
the first two scenarios predicts competition between two protein 
complexes; in contrast, the third scenario predicts that ERK as- 
sembles with MEK and HSF1 into a single protein complex. Inter- 
estingly, under HS HSF1 precipitated both MEK and ERK (Fig- 
ure 21), and ERK precipitated both MEK and HSF1 (Figure 2J). 
Although these results do not exclude the existence of indepen- 
dent MEK-ERK and MEK-HSF1 complexes, they argue against 
the two complexes being stable and prevalent, as depicted in 
the first scenario. To test the second scenario, we detected 
ERK-HSF1 interactions by PLA, as this scenario predicts HSF1 
as a substrate for both ERK and MEK. The specificity of ERK 
antibodies was validated in EF?K-depleted cells (Figure S2F). In 
contrast to evident MEK1-ERK interactions (Figure 2K), no 
apparent PLA signals denoting ERK-HSF1 interactions were 
detected (Figure 2L), suggesting lack of direct contact between 
these two proteins. Moreover, while ERK1 overexpression miti- 
gated MEK-HSF1 interactions, less ERK1 proteins were precip- 
itated with HSF1 (Figure 2G), conflicting with heightened ERK1- 
HSF1 interactions predicted by the second scenario. Thus, these 
results not only refute the second scenario but also suggest 
that ERK complexes with HSF1 via MEK, in line with the third 
scenario. Importantly, MEK depletion markedly diminished 
ERK-HSF1 colP (Figure 2J). These results suggest existence of 
a protein complex comprising ERK, MEK, and HSF1, wherein 
ERK suppresses HSF1 indirectly, via inhibition of MEK. 

MEK Phosphorylates Ser326 to Activate HSF1 

Under HS, HSF1 undergoes a series of phosphorylating events, 
among which Ser326 phosphorylation stimulates its activation 
(Guettouche et al., 2005). Yet, the identity of the kinase remains 
elusive. To determine whether MEK phosphorylates Ser326, we 
examined the effect of MEK blockade on this modification using 
a phosphospecific antibody that recognized HSF1 WT , but not 
HSF1 S326A , proteins (Figure S3 A). Either MEK knockdown or 
U0126 treatment impaired Ser326 phosphorylation (Figures 3A 
and S3B). Conversely, a constitutively active mutant, MEK1 DD 
(S218D/S222D) (Brunet et al., 1994a), induced Ser326 phos- 
phorylation and activation of HSF1 without HS (Figures 3B and 
3C). ERK inhibition enhanced Ser326 phosphorylation, and 
MEK depletion abolished this effect (Figures 3D and S3C), indi- 
cating MEK-dependent regulation. HSF1 S326A mutants dis- 
played impaired transcriptional activities (Figure 3E), congruent 
with their defective nuclear translocation and DNA-binding ca- 
pacity (Figures S3D and 3F). Moreover, HSF1 proteins were 
reduced in /V/EK-deficient cells (Figures 3A and S3E). To deter- 
mine whether MEK impacts HSF1 stability, we performed 
cycloheximide chase experiments. MEK depletion shortened 
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Figure 2. ERK, MEK, and HSFl Form a Stress-Inducible Protein Complex 

(A and B) After HS at 43°C for 30 min, endogenous HSFl proteins were precipitated from HEK293T cells. WCL, whole cell lysate; HC, heavy chain. 

(C) Endogenous MEKI -HSFl interactions were detected by PLA in HeLa cells using a rabbit anti-MEKI antibody and a mouse anti-HSFI antibody. Scale bars 
represent 50 [im for LM, 10 |im for HM. 

(D and E) Endogenous MEK-HSF1 interactions were detected by IP in HEK293T cells stably expressing shRNAs. 

(legend continued on next page) 
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the half-life of HSF1 WT protein; importantly, while HSF1 S326A mu- 
tants were less stable in /V/EK-proficient cells, their stability was 
not evidently affected by MEK deficiency (Figure 3G), suggesting 
that MEK stabilizes HSF1 largely via Ser326 phosphorylation. 
These results indicate that MEK controls in vivo HSF1 Ser326 
phosphorylation, a modification critical for its activation and 
stability. 

In vitro, recombinant MEK1 proteins directly phosphorylated 
HSF1 at Ser326, and U0126 blocked this event (Figure 3H). 
ERK was reported to phosphorylate HSF1 at Ser307 (Chu 
et al., 1 998), implying a direct ERK-HSF1 interaction. To exclude 
direct Ser326 phosphorylation by ERK, we performed in vitro 
HSF1 phosphorylation using immunoprecipitated endogenous 
ERK complexes that would comprise ERK associated with or 
without MEK. Although precipitated complexes phosphorylated 
Ser326, this event was blocked by U0126, but not by FR1 80204 
(Figure 31). U0126, but not FR1 80204, blocked phosphorylation 
of recombinant ERK1 proteins by the same precipitates (Fig- 
ure I). In contrast, FR1 80204, but not U0126, blocked phos- 
phorylation of myelin basic protein (MBP), a known ERK 
substrate (Ahn et al., 1991), by the same precipitates (Fig- 
ure 31), showing ERK blockade by FR1 80204. These results 
strongly suggest that MEK, rather than ERK, directly phosphor- 
ylates Ser326. Moreover, recombinant ERK1 proteins impeded 
in vitro HSF1 Ser326 phosphorylation by recombinant MEK1 
proteins (Figure 3J), consistent with the suppression of Ser326 
phosphorylation by ERK in vivo. In contrast, ERK promotes 
HSF1 Ser307 phosphorylation. ERK depletion diminished 
Ser307 phosphorylation, however, this effect was largely abol- 
ished in /V/EK-deficient cells (Figure S3F), again indicating MEK 
dependence. MEK inhibition impaired Ser326 phosphorylation 
but enhanced Ser307 phosphorylation of HSF1 WT proteins (Fig- 
ure S3G). Interestingly, phosphomimetic mutant HSF1 S326D pro- 
teins displayed reduced basal Ser307 phosphorylation and 
resisted induction of this phosphorylation by MEK inhibition (Fig- 
ure S3G). These results support that Ser326 phosphorylation by 
MEK represses Ser307 phosphorylation, and ERK impacts HSF1 
Ser326 and Ser307 phosphorylation via MEK inhibition. Acti- 
vated ERK phosphorylates Thr292/386 to inhibit MEK1 (Brunet 
et al., 1994b). MEK1 T292A,T386A mutants both heightened basal 
HSF1 Ser326 phosphorylation and blocked ERK-mediated sup- 
pression of this phosphorylation in cells depleted of endogenous 
MEK (Figure 3K), indicating that ERK suppresses Ser326 phos- 
phorylation via feedback phosphorylation of MEK. 

Interestingly, two MEK1 mutations identified in human mela- 
nomas, PI 24S and E203K (Nikolaev et al., 201 2), caused consti- 
tutive HSF1 phosphorylation and activation (Figures SLand 3M). 
Conversely, in human melanoma cells, MEK inhibitors impaired 
constitutive HSF1 phosphorylation and binding to HSP pro- 
moters (Figures 3N and 30). These results indicate that MEK 



controls both inducible HSF1 activation in stressed cells and 
constitutive HSF1 activation in malignant cells. 

MEK Preserves Cellular Proteostasis 

HSF1 could maintain cellular proteostasis via HSPs. To examine 
the impacts of HSF1 on protein folding, we employed the gluco- 
corticoid receptor (GR) as a model. Proper GR folding depends 
on HSP90 and misfolded proteins are cleared by the ubiquitin- 
proteasome system (Taipale et al., 2010). HSF1 knockdown 
induced GR-GFP ubiquitination and depletion (Figure S4A), indi- 
cating protein destabilization by HSF1 deficiency. This resulted 
from diminished cellular chaperoning capacity, as lysates of 
FISF1 -deficient cells were less efficient in reactivating denatured 
luciferase (Figure S4B). Similarly to FISF1 deficiency, MEK 
blockade depleted GR-GFP, and this depletion is not due to 
GFP instability or general expression changes, since co-ex- 
pressed GFP was not affected (Figure 4A). Instead, MEK 
blockade ubiquitinated GR-GFP (Figure 4B). This is not due to 
impaired proteasomal function, as proteasome inhibition by Bor- 
tezomib caused GR-GFP accumulation and MEK inhibitors did 
not affect proteasomal activities (Figures S4C-S4E). In fact, 
AZD6244 and MEK knockdown both depleted chaperoning 
capacity (Figures 4C and S4F), revealing modulation of protein 
folding and stability by MEK. 

In line with a key role of HSF1 in governing cellular proteome, 
FISF1 depletion induced protein Lys48-specific ubiquitination, a 
modification marking proteins for proteasomal degradation 
(Pickart and Eddins, 2004), in both detergent-soluble and -insol- 
uble fractions (Figure S4G). This change suggests global protein 
destabilization. Consistent with HSF1 inactivation, AZD6244 
diminished Ser326 phosphorylation, reduced HSPs, and in- 
duced overall ubiquitination (Figure 4D). Overnight AZD6244 
treatment also destabilized HSF1 (Figures 4D and S4H). MEK 
knockdown induced global ubiquitination as well (Figure S4I). 
Importantly, AZD6244 failed to deplete HSF1 and provoke ubiq- 
uitination in cells stably overexpressing HSF1 S326D (Figures 4E 
and S4J), indicating a causative role of HSF1 inactivation in pro- 
tein instability due to MEK inhibition. In vivo MEK inhibition also 
depleted HSPs and HSF1 and provoked ubiquitination in primary 
tissues (Figure 4F). 

To investigate ubiquitomic changes due to MEK inhibition, we 
conducted mass spectrometry (MS)-based analyses of ubiquiti- 
nated peptides enriched by a novel ubiquitin branch motif 
antibody (Figure 4G) (Kim et al., 2011). We compared the ubiqui- 
tomes of A2058 cells treated with and without AZD6244 for 8 hr. 
In total, 3,425 non-redundant ubiquitinated peptides, assigned 
to 1,715 distinctive proteins, were profiled (Figures 4H and 
S4K; Table SI). AZD6244 both increased and decreased peptide 
ubiquitination (Figure 4H). When a 2.5-fold cutoff was defined 
as the significant change, a collection of 76 non-redundant 



(F) Endogenous MEK-HSF1 interactions were detected in FIEK293T cells transfected with siRNAs. 

(G) Endogenous HSF1-MEK and HSF1-GFP-ERK1 interactions were detected in FIEK293T cells transfected with indicated plasmids. 

(H) Schematic depiction of three possible scenarios. P, phosphorylation. 

(I) Immediately after HS, HSF1-ERK interactions were detected by colP. 

(J) Endogenous ERK-HSF1 interactions were detected in HEK293T cells stably expressing shRNAs. 

(K and L) Endogenous ERK-MEK and ERK-HSF1 interactions were detected by PLA in HeLa cells. Scale bars represent 50 |xm for LM, 10 [xm HM. 
See also Figure S2. 
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Figure 3. MEK Phosphorylates Ser326 to Activate HSF1 

(A and B) HSF1 Ser326 phosphorylation was measured by immunoblotting in HEK293T cells stably expressing shRNAs or transfected with MEK1 DD plasmid. 

(C) GFP or MEK1 dd plasmids were co-transfected with dual HSF1 reporter plasmids into FIEK293T cells (mean ± SD, n = 5, Student’s t test). 

(D) Control or ERK-targeting siRNAs, A (siERK1_1 and siERK2_1) and B (siERK1_3 and siERK2_2), were transfected into FIEK293T cells stably expressing 
shRNAs. 

(E) GFP or FLAG-HSF1 plasmids were co-transfected with dual HSF1 reporter plasmids into HEK293T cells stably expressing shRNAs (mean ± SD, n = 6, 
ANOVA). 



(legend continued on next page) 
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peptides were distinguished. These peptides represent 68 
unique proteins that perform diverse molecular functions and 
engage in a wide array of biological processes (Figures 41 and 
4J). Intriguingly, the most enriched pathway was translation 
elongation (Figure 4J). Analyses of these 68 proteins revealed 
a functional association network encompassing three sub- 
networks (Figure 4K). Of particular interest is the “translation” 
subnetwork that consists of 7 ribosomal subunit proteins, high- 
lighting a prominent impact of MEK on ribosome machinery. 
Moreover, embedded within the network are several oncogenes 
and tumor suppressors, including c-MYC, Cyclin D1, HIF1A, 
TP53, and NF1 (Figure 4K). Although the mechanisms whereby 
MEK regulates these key players in oncogenesis are likely multi- 
faceted, accumulating evidence has implicated HSPs in modu- 
lating their stabilities (Isaacs et al., 2002; Muller et al., 2004). 
Thus, MEK could impact these proteins at least in part via HSF1 . 

To validate our MS findings, we elected several target pro- 
teins. Torsin-1 A interacting protein 2 (TORI AIP2) and ribosomal 
protein L3 (RPL3) exhibited 61.0- and 13.7-fold increases, 
respectively, in ubiquitination (Table SI). To facilitate detection, 
we expressed V5-tagged TOR1AIP2 and RPL3 proteins via a 
constitutive promoter. AZD6244 treatment for 8 hr did not alter 
levels of both V5-tagged proteins but increased their ubiqui- 
tination (Figures 4L and 4M). Our MS results also revealed 
decreased ubiquitination of proteins including c-MYC, RPL15, 
RPL24, and RPS20 (Table SI). We confirmed reductions in 
both ubiquitination and total levels of endogenous c-MYC 
proteins (Figure 4N). Similar results were also observed for V5- 
tagged RPL15, RPL24, and RPS20 (Figures S4L-S4N), suggest- 
ing shortened protein half-life. Indeed, proteasome blockade by 
MG132 prevented RPL15-V5 depletion by AZD6244 and re- 
vealed its elevated ubiquitination (Figure 40). Consistent with a 
critical role of HSF1 inactivation, both V5-tagged RPL15 and 
RPL3 proteins were highly ubiquitinated following HSF1 knock- 
down, and AZD6244 subtly affected this ubiquitination (Figures 
40 and 4P). Importantly, MG1 32 prevented depletions of endog- 
enous RPL15 and RPL3 by AZD6244 and MEK knockdown 
(Figures 4Q and S40), confirming destabilization of ribosomal 
proteins by MEK deficiency. While HSF1 knockdown diminished 
endogenous RPL15 and RPL3, HSF1 S326D expression elevated 
their basal levels and protected them from AZD6244-induced 
depletions (Figures 4R and S4P). These findings together indi- 
cate that MEK inhibition inactivates HSF1 to deplete cellular 



chaperoning capacity. In consequence, protein destabilization 
and ubiquitomic imbalance ensue. 

MEK Inhibition Provokes Aggregation and 
Amyloidogenesis in Malignant Cells 

Increased ubiquitination in detergent-insoluble fractions sug- 
gests protein aggregation (Figure 4D). To demonstrate this, 
we performed ubiquitin immunostaining. Melanoma cells were 
treated with Bortezomib and AZD6244 for 24 hr to provoke 
evident aggregation. As expected, bright fluorescent punctate 
foci emerged in Bortezomib-treated cells (Figure 5A), demar- 
cating ubiquitin-containing aggregates. AZD6244 depleted 
HSF1 and, albeit to a lesser extent, induced punctate foci 
(Figures 5A and S5A). We theorized that blockade of protea- 
somal degradation of AZD6244-induced misfolded proteins 
would aggravate aggregation. Indeed, Bortezomib co-treatment 
augmented AZD6244-induced punctate foci (Figure 5A). We 
further confirmed the impact of MEK on aggregation using an 
expanded polyglutamine tract protein (polyQ79) (Sanchez 
et al., 2003). Proteins with expanded polyQ fragments are 
aggregation-prone and causally related to neurodegenerative 
disorders (Orr and Zoghbi, 2007). PolyQ79-expressing cells, as 
expected, contained large aggregates (Figure S5B). FISF1 
knockdown and AZD6244 both enlarged polyQ aggregates; 
importantly, HSF1 S326D expression antagonized AZD6244- 
induced aggregate enlargement (Figures 5B and S5C). Bortezo- 
mib also enhanced aggregation, and combined treatment 
produced the largest aggregates (Figure 5B). Thus, both MEK 
and the proteasome suppress protein aggregation. 

Aggregation-prone proteins can form amyloid fibrils (AFs) 
enriched for (3 sheet structures (Eisenberg and Jucker, 2012). 
To assess whether HSF1 and MEK impact amyloid formation, 
we stained polyQ79-expressing cells with Thioflavin T (ThT) 
and Congo red (CR), two fluorescent dyes widely used to diag- 
nose amyloids (Chiti and Dobson, 2006). PolyQ79 expression 
enhanced ThT and CR staining (Figures S5D and S5E), as ex- 
pected. AZD6244, Bortezomib, and combined treatment further 
intensified this staining, and HSF1 S326D expression antagonized 
the effect of AZD6244 (Figure 5C). Treatments also enhanced 
ThT and CR staining of human tumor cell lines (Figures 5D 
and S5F), suggesting emergence of endogenous amyloid-like 
structures. The presence of soluble amyloid oligomers (AOs) 
confirmed amyloidogenesis. AOs are believed to constitute a 



(F) FLAG-HSF1 plasmids were transfected into FIEK293T cells stably expressing HSF1- targeting shRNAs. HSF1-DNA binding was measured after FIS as 
described in z igure 1 E using anti-FLAG antibodies. The results were normalized against nuclear FLAG-HSF1 levels (mean ± SD, n = 3, Student’s t test). 

(G) FLAG-FISF1 proteins were detected in FIEK293T cells treated with 20 |ag/ml cycloheximide (CFIX). Co-expressed GFP proteins served as internal controls. 

(H) Purified GST-MEK1 proteins (100 ng) were incubated with U0126 at RT for 20 min followed by incubation with 400 ng purified His-HSFI proteins at RT for 
30 min. HSF1 phosphorylation was detected by immunoblotting. 

(I) ERK complexes precipitated from HEK293T cells were treated with U0126 or FR1 80204, followed by incubation with 400 ng His-HSFI , 400 ng GST-ERK1, or 
1,000 ng MBP proteins. 

(J) Inactive GST-ERK1 proteins were incubated with 100 ng GST-MEK1 and 400 ng His-HSFI proteins at RT for 30 min. 

(K) LacZ or GFP-ERK1 plasmid was co-transfected with MEK1 WT or M EK1 T292AT386A plasmid into HEK293T cells stably expressing MEK- targeting shRNAs. 

(L) HSF1 Ser326 phosphorylation was detected in HEK293T cells transfected with indicated plasmids. 

(M) HSF1 activities were measured by the dual reporter system in HEK293T cells transfected with indicated plasmids (mean ± SD, n = 6, ANOVA). 

(N) WM1 15 cells were treated with 20 nM AZD6244 or 20 jxM U0126 overnight. 

(O) HSF1 chromatin immunoprecipitation (ChIP) assays were performed using WM1 15 cells treated with DMSO or 20 nM AZD6244 overnight. The results were 
normalized against the values of IgG controls (mean ± SD, n = 3, ANOVA). 

See also Figure S3. 
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Figure 4. MEK Preserves Proteostasis 

(A) GFP and GR-GFP plasmids were cotrans- 
fected into HEK293T cells followed by treatments 
with 20 nM AZD6244, 20 [iM U0126, or 200 nM 
17-DMAG for 4 hr. 

(B) GR-GFP plasmids were co-transfected into 
HEK293T cells with HA-Ub-K48 plasmids, which 
encode a mutant ubiquitin that can be conjugated 
to protein substrates only via lysine 48. Following 
treatments with 20 nM AZD6244 or 200 nM 
17-DMAG for 4 hr, GR-GFP proteins were 
precipitated and ubiquitination was detected us- 
ing anti-HA antibodies. 

(C) Denatured firefly luciferases were incubated 
with lysates of A2058 cells treated with DMSO or 
20 nM AZD6244 (mean ± SD, n = 4, ANOVA). 

(D) A2058 cells were treated with 20 nM AZD6244, 
and ubiquitinated proteins were detected in both 
detergent-soluble and detergent-insoluble frac- 
tions using Lys48-specific ubiquitin antibodies. 

(E) A2058 cells stably expressing LacZ or HSF1 S326D 
were treated with 20 nM AZD6244 for 8 hr. 

(F) C57BL/6J mice were i.p. injected with DMSO 
or AZD6244 three times a week for 2 weeks. S, 
spleen; K, kidney; L, liver. 

(G) Experimental procedures of MS-based quan- 
titation of ubiquitinated peptides, two technical 
replicates per treatment. 

(FI) Scatter plot of relative changes in peptide 
abundance between treated and control conditions. 
The green and red lines indicate 2.5-fold cutoffs. 

(I) The classification of the 68 proteins was per- 
formed using the PANTHER gene list analysis tool 
(http://www.pantherdb.org). 

(J) The Gene Ontology (GO) biological process 
enrichment analysis was performed using the 
web-based Enrichr software application. 

(K) Interaction network of the 68 proteins. Known 
and predicted protein interactions were derived 
from the STRING database (http://www.string-db. 
org), and the network was visualized using Cyto- 
scape software. 

(L and M) TOR1AIP2-V5 or RPL3-V5 plasmids 
were co-transfected with HA-Ub-K48 plasmids 
into HEK293T cells. Following 20 nM AZD6244 
treatment for 8 hr, proteins were precipitated with 
anti-V5 antibodies. 

(N) Following AZD6244 treatment, endogenous c- 
MYC proteins were precipitated from A2058 cells 
and immunoblotted with anti-ubiquitin antibodies. 
(O and P) RPL15-V5 and RPL3-V5 plasmids were 
co-transfected with HA-Ub-K48 plasmids into 
HEK293T cells stably expressing shRNAs. Cells 
were treated with 500 nM MG132 alone or co- 
treated with 20 nM AZD6244 for 8 hr. 

(Q) Endogenous RPL15 and RPL3 proteins were 
detected in A2058 cells treated with 20 nM 
AZD6244 alone or co-treated with 500 nM MG132. 

(R) Endogenous RPL15 and RPL3 proteins were 
detected in A2058 cells stably expressing LacZ or 
HSF1 S326D wjth AZD 6244 treatment. 

See also Figure S4 and Table SI . 



key toxic species in neurodegenerative disorders and can be 
detected by the conformation-dependent antibody All (Chiti 

and Dobson, 2006; Glabe, 2008; Kayed et al., 2003). Treatments 



not only exaggerated AO induction by polyQ79, but also pro- 
voked genesis of endogenous AOs in human tumor cell lines 
(Figures 5E and 5F). Similarly to AZD6244, HSF1 depletion 
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induced endogenous AOs and AFs (Figures S5G and S5H). A 
previously characterized antibody, OC, was used to detect AFs 
(Kayed et al., 2007). Again, HSF1 S326D expression suppressed 
AZD6244-induced amyloidogenesis (Figures 5G and 5H). 

A unique feature of amyloids is their ability to seed AFs (Chiti 
and Dobson, 2006). In amyloid seeding experiments, lysates 
of HSF1 -depleted cells accelerated formation of A(3 AFs 
(Figure S5I). Similarly, lysates of cells treated with AZD6244, Bor- 
tezomib, and combination all exhibited augmented seeding 
efficacy (Figure 51), which was confirmed using OC antibodies 
(Figure S5J). HSF1 S326D expression abolished the effect of 
AZD6244 (Figure 51). Furthermore, transmission electron micro- 
scopy revealed that while seeding with DMSO-treated cell 
lysates resulted into disordered rod-like protofibrils, a dense lat- 
tice of fiber-like structures emerged following seeding with 
AZD6244-treated lysates; in comparison, seeding with Bortezo- 
mib-treated lysates produced mature fibrils, and a compacted 
network of fibrils were assembled after seeding with lysates 
from combined treatment (Figure 5J). 

The amyloidogenic effects of AZD6244 and Bortezomib were 
validated genetically. Depletion of the |35 subunit (PSMB5) of 
the 26S proteasome, a primary target of Bortezomib (Oerlemans 
et al., 2008), caused accumulation of ubiquitinated proteins (Fig- 
ure S5K). Mimicking pharmacological inhibitors, genetic deple- 
tions of MEK, PSMB5, or both all provoked amyloidogenesis 
(Figures 5K and S5L-S5Q). 

To determine whether amyloids contribute to inhibitor- 
induced toxicities, we blocked amyloidogenesis with ThT, which 
impedes amyloid fibrillization via physical binding (Alavez et al., 
2011). In melanoma cells, ThT suppressed amyloid induction 
by inhibitors and improved cellular growth and survival by 50% 
(Figures 5L-5N). CR treatment and neutralization of AOs with 
All antibodies exerted similar protection (Figures 50 and 
S5R). Congruent with mitigated amyloidogenesis, HSF1 S326D 
expression not only stimulated the growth of melanoma cells 
but also rendered them refractory to MEK inhibition (Figure 5P). 

Surprisingly, AZD6244 did not induce AOs in primary mouse 
embryonic fibroblasts (MEFs) and tissues (Figures 5Q and S5S). 
This is not due to inability to detect murine amyloids, as severe 
stress did induce AOs in murine cells (Figure S5T). These results 
suggest that non-transformed cells may be more refractory to 
amyloidogenesis than malignant cells. To assess this, we 
compared AO levels in primary human mammary epithelial cells 
(PHMC), immortalized human mammary epithelial (MCF1 0A) cells, 
and tumorigenic mammary epithelial (MCF7) cells treated with 
AZD6244, Bortezomib, and the combination. Each of these three 
treatments caused marked AO induction in MCF7 cells, slight in- 
duction in MCF10A cells, and no induction in PHMC (Figure 5R). 
A similar pattern was observed in primary human Schwann cells 
(PHSC) and their malignant counterparts, 90-8TL and S462 cells 
(Figure 5S). Immortalized and transformed cells, except S462, 
also showed elevated basal levels of AOs (Figures 5R and 5S). 
Intriguingly, AO levels positively correlated with malignant states 
(Figure 5T), supporting proteomic imbalance as an intrinsic feature 
of malignancy. We theorized that the lack of elevated basal AOs in 
S462 cells might be due to amyloid-associated toxicity. Indeed, 
blockade of cell death by a pan-caspase inhibitor (Cl) elevated 
AO levels in immortalized and transformed cells, revealing height- 



ened amyloidogenesis; in contrast, it did not elevate AOs in pri- 
mary cells (Figures 5T and 5U), supporting an absence of amyloi- 
dogenesis. These results indicate that malignant cells are 
distinctively susceptible to amyloidogenesis. 

Combined Proteasome and MEK Inhibition Disrupts 
Tumor Proteostasis and Suppresses Malignancy 

MEK and proteasome inhibition, individually, disturbed proteo- 
stasis in tumor cells to certain degrees, however, the combina- 
tion of both augmented this effect and, accordingly, markedly 
impaired the growth and survival of human tumor cell lines (Fig- 
ure 6A). Of note, this combination did not impact primary cells 
(Figure 6A). 

In vivo, whereas low doses of AZD6244 or Bortezomib alone 
exhibited no significant impacts on xenografted melanomas, 
the combination potently retarded their growth (Figures 6B and 
S6A). All mice receiving the combined treatment remained alive 
and their body weights remained constant; in contrast, all mice in 
the other groups died and lost ~25% of body weight (Figures 6C 
and S6B). AZD6244 or Bortezomib alone slightly elevated ubiq- 
uitination in tumors, however, the combination markedly aggra- 
vated this effect (Figures 6D and S6C). While Bortezomib 
induced HSF1 Ser326 phosphorylation and HSP expression, 
AZD6244 co-treatment suppressed this stress response and 
induced caspase 3 cleavage (Figures 6D and S6C). Accordingly, 
AOs were evidently elevated in tumors receiving combined treat- 
ment (Figure 6E). Of particular interest is an inverse correlation 
between amounts of AOs and tumor masses (Figure S6D), sup- 
porting an adverse impact of AOs on malignant growth. 
Congruent with amyloidogenesis, tumors receiving combined 
treatment displayed potent seeding capacities and enhance- 
ment of CR staining (Figures 6F and 6G). Intratumoral AFs 
were further demonstrated by the hallmark birefringence of CR 
staining (Figure 6H), ThT labeling (Figure S6E), and immunostain- 
ing with OC antibodies (Figure 61). In contrast, the combined 
treatment did not induce AOs and apoptosis in primary tissues 
of the same tumor-bearing mice, despite elevated ubiquitination 
(Figures S6F and S6G). 

To investigate whether the combined treatment impedes 
experimental metastasis, we intravenously injected melanoma 
cells expressing a luciferase transgene into NOD/SCID mice. 
During a 6-week period, only mice receiving combined treatment 
gained body weight (Figure 6J), suggesting improved health. 
In vivo imaging detected luminescence in 40% of mice treated 
with DMSO, AZD6244, or Bortezomib alone (Figure 6K). Histo- 
logical examination confirmed melanoma metastases to the 
lung, skeletal muscle, adipose tissue, and ovary (Figure 6L; Table 
S2). In contrast, none of the mice receiving combined treatment 
displayed discernible luminescence or metastases (Figures 6K 
and 6M). Together, these results demonstrate that combined 
MEK and proteasome inhibition provokes proteotoxic stress 
and amyloidogenesis within tumors and exerts robust anti- 
neoplastic effects. 

Amyloidogenesis Is Tumor Suppressive 

Evident apoptosis in tumor regions showing intense CR 
staining suggests a causative role of amyloidogenesis in treat- 
ment-induced toxicity (Figure 7A). To confirm this, we blocked 
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intratumoral amyloid induction via in vivo CR administration. CR 
not only accelerated melanoma growth but also potently antag- 
onized the tumor suppression imposed by combined MEK and 
proteasome inhibition (Figure 7B). Penetration of CR into tumor 
tissues was indicated by intense light absorption of tumor ly- 
sates at 498 nm (Figure 7C), a characteristic of this amyloid stain 
(Sladewski et al., 2006). Congruent with enhanced malignancy, 
CR treatment enlarged tumor masses, deteriorated body condi- 
tions, and shortened animal survival (Figures 7D, S7A, and S7B). 

While CR reduced amyloids in tumor tissues, it did not 
diminish ubiquitination (Figures 7E-7G). These results indicate 
no interference of CR with MEK and proteasome inhibitors and 
further support a specific action of CR in blocking amyloid gen- 
esis. In accordance with accelerated growth, CR-treated tumors 
displayed reduced caspase 3 cleavage (Figure 7G). Collectively, 
these results strongly suggest that amyloidogenesis is tumor 
suppressive and evidently contributes to the anti-neoplastic ef- 
fects of combined MEK and proteasome inhibition. 

DISCUSSION 

HSF1 Is a New MEK Substrate 

Unexpectedly, our results reveal HSF1 as a physiological sub- 
strate for MEK, challenging the prevailing paradigm wherein 
ERK exclusively instigates the effects of RAS-RAF-MEK 
signaling. Our results further show that MEK activates but ERK 
inactivates HSF1. Importantly, our findings integrate these two 
seemingly contradictory actions and support the assembly of a 
ternary ERK-MEK-HSF1 protein complex. In aggregate, our find- 
ings propose a bifurcated, rather than a linear, RAS-RAF-MEK 
cascade. MEK, as a central nexus, both conveys upstream stim- 
uli and governs two discrete but interconnected downstream 
effector pathways, of which one is mediated by ERK and the 
other by HSF1 (Figure 7H). In a negative feedback fashion, 



ERK finely attunes HSF1 activation via inhibitory phosphorylation 
of MEK (Figure 7H). While our studies focused on MEK-mediated 
Ser326 phosphorylation, other kinases can also regulate HSF1 . 

Guarding of Proteostasis by RAS-RAF-MEK Signaling 

Our findings uncover a new function of RAS-RAF-MEK sig- 
naling in regulating proteostasis. Diverse proteotoxic stressors 
commonly activate MEK (Figure 1A). Through HSF1 activation, 
RAS-RAF-MEK signaling heightens cellular chaperoning capac- 
ity to guard proteomic integrity. 

MEK-HSF1 regulation could have key physiological implica- 
tions. Mitogens stimulate RAS/MAPK signaling and downstream 
mTORCI (Laplante and Sabatini, 2012). However, heightened 
protein synthesis driven by mTORCI encumbers cellular protein 
quality-control machinery. It thus appears necessary for mito- 
gens, via MEK, to concurrently mobilize the HSFI-controled 
chaperone system to ensure productive protein synthesis and, 
thereby, avert proteomic imbalance. Interestingly, MEK also 
governs translation capacity via HSF1 (Figure 4K). Thus, RAS- 
RAF-MEK signaling synchronizes protein quantity- and quality- 
control machineries to support cellular growth. 

It is also tempting to speculate that RAS-RAF-MEK signaling 
may antagonize protein-misfolding diseases, such as amyloid- 
osis, via guarding proteostasis. 

Proteomic Instability of Cancer 

Our findings pinpoint a pro-amyloidogenic nature of malignant 
state. The susceptibility of malignant cells to amyloid genesis 
likely originates from their debilitated proteostatic state, which 
is particularly vulnerable to perturbations. Unlike non-trans- 
formed cells, malignant cells constantly endure proteomic imbal- 
ance, evidenced by elevated basal levels of amyloids (Figures 5T 
and 5U). Accordingly, HSF1 , otherwise latent in primary cells, 
is constitutively mobilized in tumor cells to retain the fragile 



Figure 5. MEK and Proteasome Inhibition Provoke Protein Aggregation and Amyloidogenesis 

(A) WM1 15 cells treated with 20 nM AZD6244, 100 nM Bortezomib, or both for 24 hr were stained with Lys48-specific ubiquitin antibodies. Arrowheads mark 
ubiquitin-positive aggregates. Scale bar represents 10 [xm. Amounts of aggregates per cell were quantitated using ImageJ (median, n > 100, ANOVA). 

(B and C) Following transfection with polyQ79 plasmids alone or with both polyQ79 and HSF1 S326D plasmids for one day, HEK293T cells were treated with 
inhibitors as described in (A). Cells were either analyzed for aggregate size or stained with 10 |xM ThT. 

(D) Treated tumor cell lines were stained with 10 [xM ThT. Geometric means were used to calculate fold changes in ThT fluorescence intensity and the log 2 (FC) 
values were presented as a heat map. 

(E) FIEK293T cells were transfected with LacZ or polyQ79 plasmid. Following treatments, AOs were quantitated by ELISA using Al 1 antibodies (mean ± SD, n = 3, 
ANOVA). 

(F) Intrinsic AOs were detected in human tumor cell lines (mean ± SD, n = 3, ANOVA). 

(G and H) A2058 cells stably expressing LacZ or HSF1 S326D were treated for 24 hr. Amyloids were quantitated by ELISA (mean ± SD, n = 3, Student’s t test). 

(I) Synthetic A(31-42 peptides (20 jxM) were incubated at RT with gentle shaking with 20 jag lysates of A2058 cells treated with inhibitors. AF formation was 
monitored by ThT binding (mean ± SD, n = 3, ANOVA). 

(J) For TEM studies (left panel, 80,000x; right panel, 200,000x), 20 jxM synthetic A01-42 peptides were incubated with A2058 cell lysates in PBS at 37°C with 
gentle shaking for 2 days. Scale bars represent 100 nm. 

(K) HEK293T cells stably expressing different shRNAs were stained with 10 |xM ThT. 

(L and M) After pre-incubation with 10 ThT for 6 hr, A2058 cells were treated for 24 hr. Amyloids were quantitated (mean ± SD, n = 3, Student’s t test). 

(N and O) After pre-incubation with 1 0 jxM ThT for 6 hr or transfection with 1 00 ng Al 1 antibodies using JBS-Proteoducin for 1 6 hr, A2058 cells were treated for 
24 hr. Viable cells were quantitated using CellTiter-Blue reagents (mean ± SD, n = 6, Student’s t test). 

(P) A2058 cells stably expressing LacZ or HSF1 S326D were treated with DMSO or 20 nM AZD6244. Viable cells were quantitated (mean ± SD, n = 6, ANOVA). 
Relative changes in viable cells after treatment were calculated by normalizing the values of AZD6244-treated cells against the values of DMSO-treated cells at 
each time point. 

(Q-S) Following treatments with 20 nM AZD6244, 1 00 nM Bortezomib, or both for 24 hr, AOs were quantitated in primary MEFs and human cells (mean ± SD, n = 3, 
ANOVA). 

(T and U) Cells were treated with 50 fxM Q-VD-OPh overnight and AOs were quantitated (mean ± SD, n = 3, Student’s t test). 

See also Figure S5. 
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proteomic equilibrium (Figure 7H). Proteomic chaos inevitably 
ensues following either HSF1 inactivation or even mild pro- 
teotoxic insults. Hence, tumors cells rely on HSF1 to sustain 
their malignant phenotypes (Dai et al., 2007). In contrast, 
the lack of intrinsic proteotoxic stress empowers primary 
cells to effectively buffer intense proteomic fluctuations and 
thereby avert deleterious consequences— aggregation and 
amyloidogenesis. 

Excitingly, the distinct susceptibilities to proteomic perturba- 
tion between primary and malignant cells may be exploited to 
combat malignancy. Our findings support important roles for 
proteotoxic stress and amyloidogenesis in the toxicity of MEK 
inhibition in malignancy. Through protein destabilization, MEK 
inhibitors act as a proteotoxic stressor, mechanistically distinct 
from proteasome inhibitors. When applied as single agent, 
a MEK or proteasome inhibitor is incompetent to distress 
tumor proteostasis. However, combinatorial application exerts 
a profound impact, eliciting amyloidogenesis. Importantly, our 
findings strongly suggest a tumor-suppressive nature of amyloi- 
dogenesis (Figures 5L-50 and 7B). These findings imply that 
amyloidogenesis, indicative of grave proteomic imbalance, 
may be of prognostic value in monitoring tumor progression 
and evaluating therapeutic responses. Conceptually, our find- 
ings suggest that proteomic instability is an intrinsic character- 
istic associated with malignant state and that, therefore, disrup- 
tion of fragile tumor proteostasis may be a feasible therapeutic 
strategy. 

EXPERIMENTAL PROCEDURES 
Proximity Ligation Assay 

Cells were fixed with 4% formaldehyde in PBS for 1 5 min at room temperature 
(RT). After blocking with 5% goat serum in PBS with 0.3% Triton X-100, cells 
were incubated with a pair of rabbit and mouse primary antibodies 1:200 
diluted in the blocking buffer overnight at 4°C. Following incubation with Duo- 
link PLA anti-rabbit Plus and anti-mouse Minus probes (OLINK Bioscience) at 
37°C for 1 hr, ligation, rolling circle amplification, and detection were per- 
formed using Duolink In Situ Detection Reagents Red (OLINK Bioscience). 
Nuclei were stained with Hoechst 33342. Signals were visualized using a Leica 
TCS SP5 confocal microscope. 



CR and ThT Staining of Tumor Sections 

Following deparaffinization and rehydration, tumor sections were stained with 
0.5% CR in PBS at RT for 20 min followed by differentiation in alkaline solutions 
(0.01 % NaOH, 50% alcohol). Nuclei were stained with either Hoechst 33342 or 
hematoxylin. Fluorescence was visualized using a Leica TCS SP5 confocal mi- 
croscope and the birefringence visualized using a Leica DM5000B upright mi- 
croscope equipped with polarized light filters. For ThT staining, sections were 
stained with 0.2% ThT in PBS at RT for 10 min, rinsed in 1% acetic acid for 
2 min, and washed with ddH 2 0 three times. Nuclei were stained with SYTO 
62 (Life Technologies). 

Melanoma Xenograft Models 

A2058 cells were subcutaneously (s.c.) injected into the left flanks of 9-week- 
old female NOD.CB17-Prkdc <scid>/J (NOD/SCID) mice (The Jackson Labo- 
ratory). For CR treatment, mice were intraperitoneally (i.p.) injected with PBS or 
CR 1 day prior to combined AZD6244 and Bortezomib treatments. Tumor vol- 
umes were calculated following the formula 4/37i;R 3 . For experimental metas- 
tasis, engineered A2058 cells were transplanted into 10-week-old female 
NOD/SCI D mice via tail vein injections. All mouse experiments were performed 
under a protocol approved by The Jackson Laboratory Animal Care and Use 
Committee. 

Statistical Methods 

All statistical analyses were performed using Prism 6.0 (GraphPad software). 
Statistical significance: *p < 0.05; **p < 0.01 ; ***p < 0.001 . 

See also the Extended Experimental Procedures. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Extended Experimental Procedures, seven 
figures, and two tables and can be found with this article online at http://dx.doi. 
org/1 0. 1 01 6/j.cell.201 5.01 .028. 
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Figure 6. Combined MEK and Proteasome Inhibition Exerts Potent Tumor-Suppressive Effects 

(A) After treatments with 20 nM AZD6244, 1 00 nM Bortezomib, or both for 24 hr, viable cells were quantitated using CellTiter-Blue reagents (mean ± SD, n = 6, ANOVA). 
(B and C) A2058 cells (1 x 10 6 ) were s.c. injected into NOD/SCID mice. After 7 days, mice were treated with DMSO, 5 mg/Kg AZD6244, 0.5 mg/Kg Bortezomib, or 
the combination via i.p. injection three times a week. Tumor volumes were measured using a caliper weekly (mean ± SEM, ANOVA). Tumor growth curves were 
fitted to exponential growth models to derive tumor-doubling time (DT). Kaplan-Meier survival curve was plotted for each group (Log-rank test). 

(D) Proteins were detected by immunoblotting, three tumors per group. 

(E) Tumor lysates were used to quantitate AOs, five tumors per group (mean ± SD, n = 3, ANOVA). 

(F) Tumor lysates were used to seed A(31-42 peptides, five tumors per group (mean ± SD, n = 3, ANOVA). 

(G) Tumor sections were stained with CR, five tumors per group. Ten random fields were taken for each section. Scale bar represents 50 |xm. Total CR fluo- 
rescence in each field was quantitated using ImageJ and normalized against total nuclei (median, n = 50, ANOVA). 

(H) Following CR staining, tumor sections were visualized under polarized light microscopy. Scale bar represents 50 |xm. 

(I) Following staining with AF-specific antibodies (OC), sections of tumors receiving combined treatment were further stained with CR. Scale bar represents 
50 [xm. 

(J) A2058 cells (1 x 10 6 ) stably expressing firefly luciferase transgene were intravenously (i.v.) injected into NOD/SCID mice. Treatments were initiated 1 day after 
as described in (B) for 6 weeks. Body weights were monitored weekly (mean ± SD, n = 10, ANOVA). 

(K) Detection of metastases by in vivo bioluminescence imaging. 

(L) Representative micrographs illustrate metastatic melanomas in the lung, skeletal muscle, pelvic adipose tissue, and ovary. T, tumors; L, lung; M, muscle; B, 
bone; A, adipose tissue; OF, ovarian follicle. Scale bar represents 500 |xm. 

(M) Combined MEK and proteasome inhibition prevents experimental melanoma metastasis (Barnard’s exact test). 

See also Figure S6 and Table S2. 
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Figure 7. Amyloidogenesis Suppresses Tumor Growth 

(A) Sections of melanomas receiving combined treatment were stained with cleaved caspase 3 antibodies followed by CR staining. Arrowheads and arrows 
indicate condensed and fragmented nuclei, respectively. Scale bar represents 50 pm. 

(B) A2058 cells (1 x 10 6 ) were s.c. injected into NOD/SCID mice. After 7 days, mice were treated with 1 mg/30 g CR via i.p. injection 1 day prior to combined 
treatment. Tumor volumes were measured weekly (mean ± SD, ANOVA). 

(C) Lysates of CR-treated tumors exhibited absorbance at 498 nm, three tumors per group (mean ± SD, n = 3). Lysis buffer containing CR served as a positive 
control. 

(D) Kaplan-Meier survival curves were compared (Log-rank test). 

(E and F) Both detergent-soluble and detergent-insoluble fractions of tumor lysates were used to quantitate amyloids, three tumors per group (mean ± SD, n = 3, 
Student’s t test). 



(legend continued on next page) 
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(G) Proteins were detected by immunoblotting, three tumors per group. 

(H) Schematic depiction of the interplay among MEK, ERK, and HSF1, and its role in regulating proteome stability. Balanced proteostasis suppresses toxic protein 
aggregation and amyloidogenesis, thereby facilitating tumorigenesis. 

See also Figure S7. 
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SUMMARY 

Impaired insulin-mediated suppression of hepatic 
glucose production (HGP) plays a major role in the 
pathogenesis of type 2 diabetes (T2D), yet the molec- 
ular mechanism by which this occurs remains 
unknown. Using a novel in vivo metabolomics 
approach, we show that the major mechanism by 
which insulin suppresses HGP is through reductions 
in hepatic acetyl CoA by suppression of lipolysis in 
white adipose tissue (WAT) leading to reductions in 
pyruvate carboxylase flux. This mechanism was 
confirmed in mice and rats with genetic ablation of in- 
sulin signaling and mice lacking adipose triglyceride 
lipase. Insulin’s ability to suppress hepatic acetyl 
CoA, PC activity, and lipolysis was lost in high-fat- 
fed rats, a phenomenon reversible by IL-6 neutraliza- 
tion and inducible by IL-6 infusion. Taken together, 
these data identify WAT-derived hepatic acetyl CoA 
as the main regulator of HGP by insulin and link it 
to inflammation-induced hepatic insulin resistance 
associated with obesity and T2D. 

INTRODUCTION 

The Centers for Disease Control predict that by the year 2050, 
one in three Americans will suffer from type 2 diabetes (T2D) 
(Boyle et al., 201 0); therefore, efforts to understand and intervene 
in the pathogenesis of the disease are urgently needed. Fasting 
hyperglycemia, due to increased rates of hepatic gluconeogen- 
esis, and the inability of insulin to suppress this process are the 
hallmark of T2D, but the molecular mechanism for this alteration 
is unknown (Kahn et al., 201 4; Olefsky and Glass, 201 0; Robbins 

CrossMark 



et al., 2014; Samuel and Shulman, 2012). In this regard, mice 
lacking Aktl, Akt2, and Foxol (TKO) (Lu et al., 2012), the 
canonical insulin signaling proteins thought to regulate hepatic 
glucose production (HGP), surprisingly do not show any defect 
in insulin-mediated suppression of HGP. 

We hypothesized that insulin’s ability to suppress lipolysis in 
white adipose tissue (WAT) may be critical for the suppression 
of HGP by reducing fatty acid flux to the liver resulting in 
decreased hepatic acetyl CoA concentrations and decreased 
pyruvate carboxylase (PC) activity, the first step in the conver- 
sion of pyruvate to glucose. While hepatic acetyl CoA has been 
previously shown to modulate PC activity in vitro (Barritt et al., 
1966; Cazzulo and Stoppani, 1968; Cooper and Benedict, 
1966; Keech and Utter, 1963; Krebs et al., 1965), methodological 
limitations stemming from acetyl CoA’s low concentrations and 
rapid degradation ex vivo have prevented its measurement 
in vivo. 

We further hypothesized that increased hepatic acetyl CoA, 
due to increased rates of WAT lipolysis, is responsible for 
increased fasting HGP and reduced insulin-mediated suppres- 
sion of HGP in high-fat-fed rodent models of diet-induced 
obesity associated with hepatic insulin resistance and hypergly- 
cemia. In this regard, inflammatory cytokines are well-estab- 
lished mediators of lipolysis (Fonseca-Alaniz et al., 2007; Savage 
et al., 2005; Suganami and Ogawa, 2010), and protection from 
inflammation-associated insulin resistance could explain the 
preserved insulin sensitivity of macrophage-specific c-JUN 
N-terminal kinase (mcpJNK) knockout mice fed a high-fat diet 
(Han et al., 2013). 

To address these questions, we developed a novel liquid chro- 
matography/tandem mass spectrometry (LC-MS/MS) technique 
to measure hepatic acetyl CoA content from in situ freeze- 
clamped liver within 10 s of intravenous euthanasia in awake, 
unrestrained rats. These measurements were combined with 
measurements of whole-body rates of lipolysis, assessed by 
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rates of [1 ,1 ,2,3,3- 2 H 5 ] glycerol and [U- 13 C] palmitate turnover, 
combined with rates of glucose turnover assessed by [3- 3 H] 
glucose before and during a hyperinsulinemic-euglycemic 
clamp. Finally, we also performed additional studies to quantify 
in vivo pyruvate carboxylase flux and relative contributions of 
oxaloacetate and glycerol to hepatic gluconeogenesis using a 
combined NMR-LC- MS/MS positional isotopomer approach 
with [3- 13 C] lactate as a tracer. Using this comprehensive in vivo 
metabolomics approach, we show that hepatic acetyl CoA is 
the key regulator of hepatic insulin action in normal fasted, 
free-ranging rodents in vivo and that fasting hyperglycemia 
and insulin resistance in high-fat-fed rodents result from an inap- 
propriate increase in hepatic acetyl CoA concentrations due to 
macrophage-induced WAT lipolysis. 

RESULTS 

Insulin Suppression of Glucose Production Is 
Temporally Associated with Suppression of Lipolysis 
and Hepatic Acetyl CoA 

In order to examine the respective roles of hepatic acetyl CoA 
and glycerol turnover in the regulation of HGP by insulin, we 
infused awake rats with insulin to mimic physiologic postprandial 
insulin concentrations during a hyperinsulinemic-euglycemic 
clamp and found that plasma non-esterified fatty acid (NEFA) 
and glycerol concentrations were suppressed by 90% within 
5 min, and HGP was suppressed by ~70% within 10 min after 
initiation of the insulin infusion as reflected by a rapid increase 
in the plasma glucose specific activity (Figures 1A-1C; Figures 
S1A-S1D). This reduction in HGP was associated with a 50% 
reduction in hepatic acetyl CoA concentrations within 10 min 
of the start of the insulin infusion as well as a 60% suppression 
of whole-body glycerol and palmitate turnover at steady state 
with no changes in plasma lactate, glucagon, or liver glycogen 
concentrations (Figures 1 D-1 F and SI E-S1G). Therefore, these 
data demonstrate that the major effect of insulin to suppress 
HGP under these conditions was through suppression of hepatic 
gluconeogenesis. 

Clamping Hepatic Acetyl CoA and Whole-Body Glycerol 
Turnover Abrogates Insulin Suppression of HGP 

Based on the previous data, we next examined whether altering 
hepatic acetyl CoA concentrations, and/or altering rates of 
whole-body glycerol turnover, would change rates of HGP dur- 
ing a hyperinsulinemic-euglycemic clamp. In order to modify he- 
patic acetyl CoA concentrations, we infused sodium acetate at 
increasing rates during a hyperinsulinemic-euglycemic clamp, 
and observed a strong correlation between hepatic acetyl CoA 
concentrations and both pyruvate carboxylase (PC) activity 
(r 2 = 0.85, p < 0.0001) and HGP (r 2 = 0.68, p < 0.0001) during 
the clamp, without any changes in liver glycogen content (Fig- 
ures 1G-1 1 and S1H-S1M). 

Likewise, in order to assess the potential contribution of insu- 
lin-mediated suppression of glycerol turnover to insulin-medi- 
ated suppression of HGP by a substrate-push mechanism, we 
infused glycerol at increasing rates in awake rats during a hyper- 
insulinemic-euglycemic clamp and observed a strong correla- 
tion (r 2 = 0.55, p < 0.0001) between rates of glycerol turnover 



and rates of HGP, without any change in liver glycogen or acetyl 
CoA (Figures 1 J— 1 L and SI N-S1 R). Taken together, these data 
demonstrate that modulation of either hepatic acetyl CoA con- 
centrations or glycerol turnover can independently modulate 
rates of HGP during a hyperinsulinemic-euglycemic clamp. 

In order to determine whether insulin-mediated suppression of 
hepatic acetyl CoA concentrations and glycerol turnover were 
responsible for insulin-mediated suppression of HGP, we co- 
infused both acetate and glycerol to match hepatic acetyl CoA 
concentrations and rates of glycerol turnover during a hyperinsu- 
linemic-euglycemic clamp to those observed under basal condi- 
tions (Figures 2A-2D; Figures S2A and S2B). Matching hepatic 
acetyl CoA concentrations and whole-body glycerol turnover 
rates to baseline levels completely abrogated insulin-mediated 
suppression of PC flux and HGP, without any changes to plasma 
glucagon or liver glycogen concentrations, or expression of glu- 
coneogenic proteins in the liver (Figures 2E, 2F, and S2C-S2E). 
Multiple regression analysis confirmed the linear relationships 
between HGP and both liver acetyl CoA and whole-body glycerol 
turnover (p = 0.002 and p = 0.01 , respectively). We next assessed 
the relative contributions of oxaloacetate and glycerol conver- 
sion to glucose before and during a clamp and found that the 
suppression of hepatic gluconeogenesis could entirely be 
ascribed to suppression of lipolysis, with 85% of the suppression 
of HGP attributed to reduced conversion of oxaloacetate to 
glucose and the remainder attributed to reduced conversion of 
glycerol to glucose (Figure 2E). 

Liver Akt1/Ak2/Foxo1 Knockout Mice Suppress 
Lipolysis, Hepatic Acetyl CoA, and HGP Normally, but 
Acetate and Glycerol Infusion Abrogates Insulin 
Suppression of HGP 

TKO mice exhibited no alterations in basal or insulin-mediated 
reductions in rates of HGP before and during a hyperinsuline- 
mic-euglycemic clamp despite clear knockdown of Akt1/Akt2 
and Foxol protein concentrations in liver (Lu et al., 2012) (Fig- 
ure 3A; Figures S3A-S3C). Consistent with our data in awake 
rats, hepatic acetyl CoA concentrations, PC activity, and rates 
of whole-body lipolysis were similarly suppressed in WT and 
TKO mice during the hyperinsulinemic-euglycemic clamp, 
without any change in hepatic glycogen content (Figures 3B- 
3D and S3D-S3G). Furthermore, co-infusion of acetate and glyc- 
erol to maintain basal hepatic acetyl CoA concentrations and 
basal rates of glycerol turnover, fully abrogated insulin’s ability 
to suppress HGP during the hyperinsulinemic-euglycemic clamp 
(Figures 3B-3D). 

Inhibition of Lipolysis Decreases Hepatic Acetyl CoA 
and HGP in Insulin Receptor Antisense Oligonucleotide- 
Treated Rats Lacking Insulin Receptor in Liver and Fat 

In order to more directly examine the putative requirement for 
suppression of lipolysis to permit suppression of HGP, we 
treated rats with a 2'-0-methoxyethyl chimeric antisense oligo- 
nucleotide to knock down the insulin receptor (IR ASO) in liver 
and adipose tissue (Figure 3E). IR ASO-treated rats displayed 
severe insulin resistance manifested by failure to suppress lipol- 
ysis, hepatic acetyl CoA, and HGP during a hyperinsulinemic- 
euglycemic clamp; however, all of these effects were abrogated 
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Figure 1. Rapid Suppression of Hepatic Glucose Production Rates Is Temporally Associated with a Parallel Rapid Suppression of Lipolysis 

(A) Hepatic glucose production measured by the Steele equation (Steele, 1959). Comparisons by ANOVA with Bonferroni’s multiple comparisons test. ****p < 
0.0001 versus time 0. 

(B and C) Plasma non-esterified fatty acid (NEFA) and glycerol concentrations during the hyperinsulinemic-euglycemic clamp. In (A)-(C), n = 6 per time point. 
(D) Hepatic acetyl CoA concentrations, n = 4 per time point. *p < 0.01 , **p < 0.01 versus time 0 by the two-tailed unpaired Student’s t test. 

(E and F) Whole-body palmitate and glycerol turnover rates. 

(G-l) Liver acetyl CoA concentrations, hepatic glucose production, and pyruvate carboxylase activity in rats undergoing a hyperinsulinemic-euglycemic clamp 
with co-infusion of acetate. In (G)-(L), n = 4 per group. 

(J-L) Plasma glycerol concentrations, whole-body glycerol turnover, and hepatic glucose production in rats undergoing a hyperinsulinemic-euglycemic clamp 
with co-infusion of glycerol. 

In (G)-(L), n = 4 per group. *p < 0.05, **p < 0.01 , ***p < 0.001 , ****p < 0.0001 versus 0 [xmol/(kg-min). In all panels, data are mean ± SEM. See also Figure SI . 



by administration of atglistatin, a specific inhibitor of adipose tri- 
glyceride lipase (Figures 3F-3H and S3H-S3L), attributing the 
suppression of HGP to the inhibition of lipolysis and subsequent 
reductions in hepatic acetyl CoA and PC activity in both control 
and IR knockdown rats. 



High-Fat-Fed Rats Manifest Increased Rates of HGP and 
Hepatic Insulin Resistance Associated with Increased 
Lipolysis and Increased Hepatic Acetyl CoA 

In order to understand the potential role of hepatic acetyl 
CoA in the pathogenesis of fasting and postprandial 
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Figure 2. Normalizing Hepatic Acetyl CoA Concentrations and Whole-Body Glycerol Turnover to Baseline Levels in the Hyperinsulinemic- 
Euglycemic Clamp Abrogates Insulin’s Ability to Suppress Hepatic Glucose Production 

(A) Liver acetyl CoA content. 

(B) Whole-body glycerol turnover. 

(C) Plasma glycerol concentrations. 

(D) Glucose infusion rate. 

(E) Hepatic glucose production from oxaloacetate (lower bars) and glycerol (upper bars). 

(F) PC activity. 

In all panels, ****p < 0.0001 versus basal, §§§§p < 0.0001 versus hyperinsulinemic-euglycemic clamp, n = 6 per group. Data are mean ± SEM. Comparisons by 
ANOVA with Bonferroni’s multiple comparisons test. See also Figure S2. 



hyperglycemia in a chronic high-fat-fed rodent model of 
insulin resistance associated with inflammation, we fed male 
Sprague-Dawley rats a high-fat diet for 4 weeks and observed, 
as expected, increases in fasting plasma glucose, insulin, 
and non-esterified fatty acid (NEFA) concentrations without 
any increases in plasma glucagon concentrations (Figures 
S4A-S4D). Fasting hyperglycemia in the HFD group was 
associated with a 25% increase in rates of basal HGP, 60% 
of which was derived from increases in hepatic pyruvate 
carboxylase flux, and 40% from increased conversion of 
glycerol to glucose. In chow-fed rats, insulin suppressed 



HGP in a hyperinsulinemic-euglycemic clamp, 85% through 
reductions in flux through hepatic pyruvate carboxylase, 
and 15% through reductions in the contribution of glyc- 
erol to glucose (Figure 4A; Figures S4E-S4G). How- 
ever, in the high-fat-fed rats, insulin failed to suppress HGP 
during a hyperinsulinemic-euglycemic clamp associated with 
increased basal and clamped acetyl CoA and PC activity, 
and with higher rates of lipolysis both before and during the 
hyperinsulinemic-euglycemic clamp in the HFD group, without 
any differences in gluconeogenic protein expression (Figures 
4B-4E, S4H, and S4I). 
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Figure 3. Suppression of Lipolysis Results in Decreased Hepatic Acetyl CoA Concentrations and Suppression of Hepatic Glucose 
Production Independent of Hepatic Insulin Signaling 

(A) Protein expression in Aktl and 2 and Foxol knockout mice. 

(B) Hepatic glucose production. In all panels, black bars, basal; white bars, clamp; and gray bars, clamp+acetate+glycerol. 

(C) Hepatic acetyl CoA concentrations. 

(D) Hepatic pyruvate carboxylase activity. In (B)-(D), *p < 0.05, **p < 0.01 , ***p < 0.001 versus basal, §p < 0.05, §§p < 0.01 , §§§p < 0.001 versus hyperinsulinemic- 
euglycemic clamp by ANOVA. Data are mean ± SEM of n = 5 per group (basal), 8-9 per group (hyperinsulinemic-euglycemic clamp), and 3-4 per group 
(hyperinsulinemic-euglycemic clamp + acetate + glycerol). 

(E) Protein expression in insulin receptor knockdown rats. 

(F) Hepatic glucose production. Atglistatin-treated rats were given 600 [xmol/kg. 

(G) Liver acetyl CoA. 

(H) Hepatic PC activity. 

In (F)-(H), ****p < 0.0001 versus basal, §§§§p < 0.0001 versus hyperinsulinemic-euglycemic clamp by ANOVA. Data are mean ± SEM of n = 6 per group. See also 

Figure S3. 



Increased Lipolysis Plays a Key Role in Excess Hepatic 
Gluconeogenesis in High-Fat-Fed Rodents 

In order to confirm whether excess lipolysis causes excess HGP 
in diet-induced obesity, we studied high-fat-fed adipose- 
specific adipose triglyceride lipase (ATGL) knockout mice. These 
mice exhibited a 5-fold increase in insulin’s ability to suppress 
HGP in a hyperinsulinemic-euglycemic clamp, associated with 
40%-60% reductions in hepatic acetyl CoA, PC activity, and 
whole-body palmitate and glycerol turnover during the clamp 
(Figures 4F-4J; Figures S5J-S5M). 



High-Fat-Fed Rats Manifest Insulin Resistance in White 
Adipose Tissue 

T o determine the primary source(s) of the excess lipolysis in high- 
fat-fed rodents, we measured the ratio of tissue to plasma 13 C 16 
palmitate in rats infused with 13 C 16 palmitate. The ratio of tissue 
to plasma 13 C 16 palmitate was ~100% in liver, skeletal muscle, 
and kidney, but only ~65% in WAT, confirming WAT as the 
source of the majority of fatty acids derived from lipolysis in 
both the basal and clamped state (Figures S4N and S40). In addi- 
tion, in order to determine the proportion of acetyl CoA derived 
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Figure 4. Four Weeks of High-Fat Feeding Increases Hepatic Glucose Production as a Result of Increased Lipolysis 

(A) Hepatic glucose production from oxaloacetate and glycerol. 

(B and C) Liver acetyl CoA and PC activity. 

(D and E) Whole-body palmitate and glycerol turnover. 

In (A)-(E), data are mean ± SEM of n = 6 per group. *p < 0.05, **p < 0.01 , ****p < 0.0001 versus chow-fed, basal rats by the two-tailed unpaired Student’s t test. 
(F) Hepatic glucose production in basal and clamped high-fat-fed adipose-specific ATGL knockout mice. 

(G and H) Liver acetyl CoA and PC activity. 

(I and J) Whole-body palmitate and glycerol turnover. 

In (F)-(J), data are mean ± SEM of n = 8 per group. ***p < 0.001 , ****p < 0.0001 versus basal WT, ####p < 0.0001 versus basal ATGL knockouts by the two-tailed 
unpaired Student’s t test. See also Figures S4 and S5. 
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from palmitate, we measured the ratio of hepatic 13 C 2 acetyl CoA 
to 13 C 16 palmitate in rats infused with 13 C 16 palmitate. In both 
chow and high-fat-fed rats, under both basal and clamped con- 
ditions, the enrichment of hepatic acetyl CoA was approximately 
75% of palmitate, showing that a large proportion of hepatic 
acetyl CoA is derived from palmitate in vivo (Figure S4P). 

WAT insulin resistance in high-fat-fed rats was reflected in the 
failure of insulin to stimulate Akt phosphorylation, glucose uptake, 
or lipogenesis in WAT (Figures S4Q-S4S). The increased lipolysis 
was associated with a larger mean diameter of large adipocytes, a 
reduced number of large adipocytes (Kursawe et al., 201 0), and a 
doubling in WAT comparative gene identification-58 (CGI-58) 
protein expression (Yamaguchi et al., 2004, 2007), without any 
other changes in lipolytic or insulin signaling proteins in WAT, 
liver glycogen or ceramide content, plasma lactate, FGF-21, 
adiponectin, or lipolytic hormones (Figures S T-S4EE). Liver 
concentrations of other putative mediators of gluconeogenesis, 
including the ratios of ATP to ADP, ATP to AMP, NAD + to 
NADH, and four TCA cycle intermediates were also unchanged 
in high-fat-fed or clamped rats (Figures S5A-S5G). 

Inhibition of Adipose Triglyceride Lipase Normalizes 
Hepatic Acetyl CoA Content, PC Activity, HGP, and 
Plasma Glucose Concentrations in High-Fat-Fed Rats 

To determine whether increased rates of WAT lipolysis leading to 
increased hepatic acetyl CoA concentrations and PC activity in 
HFD rats were responsible for the increased rates of HGP in 
the basal state and the failure of insulin to suppress HGP during 
the hyperinsulinemic-euglycemic clamp, we treated rats with at- 
glistatin. Atglistatin treatment normalized fasting plasma glucose 
and insulin concentrations and HGP by lowering WAT lipolysis, 
liver acetyl CoA concentrations, and PC activity without affecting 
plasma glucagon concentrations (Figures S5H-S5Q). These 
data demonstrate a critical but rapidly reversible role for 
WAT lipolysis driving increases in hepatic acetyl CoA and PC ac- 
tivity resulting in hyperglycemia in high-fat-fed, insulin-resistant 
rats. 

To further examine the role of increases in hepatic acetyl CoA 
in causing excess gluconeogenesis, we treated chow and high- 
fat-fed rats with etomoxir, a potent inhibitor of carnitine palmi- 
toyltransferase-1 (CPT-1). Consistent with the previous studies 
demonstrating a strong relationship between hepatic acetyl 
CoA concentration and rates of HGP, etomoxir treatment re- 
sulted in reductions in hepatic acetyl CoA content, which was 
associated with lower fasting plasma glucose and insulin con- 
centrations, reduced rates of HGP, and reduced PC activity 
without any change in liver glycogen content (Figures S5R- 
S5W). These data further support the hypothesis that hepatic 
acetyl CoA is a key regulator of hepatic gluconeogenesis and 
HGP in normal, fasting rodents. 

Adipose Tissue Inflammation Increases Hepatic Acetyl 
CoA Content and PC Activity and Causes Hepatic Insulin 
Resistance in HFD Rats 

Next, we sought to determine the reason for the increased rates 
of lipolysis in HFD rats. In the absence of any changes in lipolytic 
hormone (growth hormone, epinephrine, norepinephrine, corti- 
costerone) concentrations, we hypothesized that adipose tissue 



inflammation may play an important role in this process (Man- 
teiga et al., 2013; Olefsky and Glass, 2010; Robbins et al., 

2014). Consistent with this hypothesis, we observed a doubling 
of plasma and WAT macrophage interleukin-6 (IL-6) concentra- 
tions, without changes in any other inflammatory mediators (Fig- 
ures 5A, 5B, and S6A). We confirmed white adipose tissue as a 
target of IL-6 by incubating WAT with the concentrations of 
IL-6 measured in plasma in vivo (15 and 25 nM) and observed 
a 50% increase in fatty acid release into the media over 24 hr 
(Figure S6B). Increases in granulocyte macrophage colony stim- 
ulating factor (GM-CSF) in both plasma and WAT may be 
responsible for attracting increased macrophages to WAT in 
high-fat-fed rats and/or inducing their proliferation, and IL-6 
has been shown to stimulate lipolysis through activation of the 
JAK-Stat signaling pathway (Cernkovich et al., 2008; Guschin 
et al., 1995; Lutticken et al., 1994; Stahl et al., 1994). A modest 
20% increase in WAT tumor necrosis factor-a (TNF-a) was also 
observed, consistent with increased macrophage activation in 
HFD rats (Figures S6C-S6D). Time-course studies with varying 
periods of HFD treatment confirmed a temporal relationship be- 
tween fasting plasma glucose, insulin, and NEFA concentrations 
and both plasma and WAT macrophage IL-6 and GM-CSF con- 
centrations (Figures S6E-S6K), with each increasing between 
days 6 and 8. In addition, by 6 days of high-fat feeding, adipo- 
cytes had begun to exhibit hypertrophy (Figure S6L). 

In order to directly assess the role of excess IL-6 in promoting 
increased lipolysis, we treated H FD rats with a monoclonal neutral- 
izing antibody to IL-6 (Figure S6M). Neutralizing IL-6 in HFD rats 
reduced plasma glucose, NEFA, glycerol, and insulin concentra- 
tions to normal levels without any change in plasma glucagon 
concentrations (Figures S6N-S6S). By lowering rates of lipolysis 
and reducing hepatic acetyl CoA concentrations, anti-IL-6 treat- 
ment resulted in a 50% reduction in hepatic PC activity and 
normalized rates of HGP in the HFD rat, without affecting macro- 
phage TNF-a concentrations (Figures 5C-5F and S6T-S6V). 

IL-6 Infusion Induces Hepatic Insulin Resistance 
through Increased Adipose Tissue Lipolysis and 
Increased Hepatic Acetyl CoA Content, PC Activity, 
and HGP 

In order to directly examine the role of IL-6 in mediating increased 
HGP through increased lipolysis, hepatic acetyl CoA content, 
and PC activity, we performed a 4 hr intra-arterial infusion of re- 
combinant IL-6 in control rats (Figure S6W). We found that an 
infusion of IL-6 caused an increase in plasma glucose, NEFA, 
and insulin concentrations, which were associated with a marked 
increase in rates of whole-body lipolysis. These increases in rates 
of lipolysis were associated with increases in HGP, which could 
be attributed to a doubling in hepatic acetyl CoA content and 
an 80% increase in hepatic PC activity without any change in 
plasma glucagon concentrations (Figures 5G-5J and S6X- 
S6EE). However, these effects were fully abrogated by ATGL 
inhibition. The induction of lipolysis in IL-6-infused rats was asso- 
ciated with an increase in expression of CGI-58, a potent co-acti- 
vator of ATGL, in WAT. These data suggest that IL-6 stimulation 
of lipolysis may be attributed in part to IL-6-induced increases in 
WAT CGI-58 concentrations (Figure S6FF), highlighting a poten- 
tial novel role for CGI-58 in inflammation-induced increases in 
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WAT lipolysis in insulin-resistant rats. Taken together, these data 
are consistent with the hypothesis that local production of IL-6 
from macrophages in WAT promotes increased lipolysis leading 
to increased hepatic acetyl CoA content, PC activity, HGP, and 
hyperglycemia in high-fat-fed rodents. 

Macrophage-Specific JNK Knockout Mice Are 
Protected from Diet-Induced Insulin Resistance 

In order to examine whether local production of IL-6, from mac- 
rophages in WAT, promotes increased lipolysis, hepatic acetyl 
CoA content, PC activity, HGP, and hyperglycemia in high-fat- 
fed rodents, we next examined whether this mechanism would 
explain why mcpJNK KO mice, which have strikingly low plasma 



Figure 5. Increases in Plasma lnterleukin-6 
Concentrations Cause Hyperglycemia in 
High-Fat-Fed Rats 

(A and B) IL-6 concentrations in plasma and white 
adipose tissue macrophages. 

(C-F) Hepatic glucose production, PC activity, 
liver acetyl CoA, and whole-body palmitate turn- 
over in high-fat-fed rats treated with an IL-6- 
neutralizing antibody. 

(G and H) Hepatic glucose production, liver PC 
activity, hepatic acetyl CoA, and whole-body 
palmitate turnover in control rats treated with or 
without IL-6 or IL-6 and atglistatin (200 ^mol/kg). 
In all panels, data are mean ± SEM of n = 6 per 
group. Comparisons by the two-tailed unpaired 
Student’s t test (A-E) or by ANOVA with Bonfer- 
roni’s multiple comparisons test (F-H). *p < 0.05, 
**p < 0.01 , ***p < 0.001 versus controls; §§p < 0.01 , 
§§§p < 0.001 versus chow-fed, IL-6-infused rats. 
See also Figure S6. 



and WAT macrophage IL-6 concentra- 
tions (Figures 6A and 6B), are protected 
from high-fat diet-induced hepatic insulin 
resistance. We found that macrophage- 
specific mcpJNK KO mice exhibited 
improved suppression of HGP during 
the hyperinsulinemic-euglycemic clamp, 
which was associated with reductions 
in clamped glycerol and palmitate turn- 
over, hepatic acetyl CoA content, and 
hepatic PC activity (Figures 6C-6H and 
S7A-S7C). All of these effects were 
abrogated by infusion of IL-6 to 
match plasma IL-6 concentrations in 
the mcpJNK knockouts to the wild-type 
group, demonstrating a critical role for 
IL-6-induced WAT lipolysis in mediating 
hepatic insulin resistance in high-fat- 
fed mice. 



Obese Insulin-Resistant 
Adolescents Display Increased 
Lipolysis and Increased WAT IL-6 

In order to determine whether our find- 
ings in rats would translate to humans, 
we measured rates of HGP and lipolysis in obese insulin- 
resistant adolescents. Compared to weight-matched obese 
insulin-sensitive control subjects, insulin-resistant subjects dis- 
played fasting hyperglycemia and hyperinsulinemia, which 
were associated with increased rates of HGP, impaired 
insulin-mediated suppression of lipolysis and impaired insulin- 
mediated suppression of HGP. Impaired insulin suppression 
of lipolysis was associated with an increased diameter of large 
adipocytes and increased macrophage infiltration into the 
subcutaneous white adipose tissue (Figures 7A-7I; Table SI). 
Consistent with our findings in insulin-resistant high-fat-fed 
rodents, insulin-resistant obese adolescents displayed in- 
creased plasma IL-6 concentrations, and an approximately 
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Figure 6. Mice Lacking JNK in Macrophages Are Protected from Diet-Induced Hepatic Insulin Resistance 

(A and B) Plasma and WAT macrophage IL-6 concentrations. 

(C) Glucose infusion rate to maintain euglycemia during a hyperinsulinemic-euglycemic clamp. 

(D) Liver acetyl CoA concentrations. 

(E) Hepatic glucose production. 

(F) Liver PC activity. 

(G and H) Whole-body palmitate and glycerol turnover. 

**p < 0.01 , ***p < 0.001 , ****p < 0.0001 versus basal mice; §p < 0.05, §§§p < 0.001 , §§§§p < 0.0001 versus wild-type, clamped mice. Data are mean ± SEM of n = 1 0 
(WT), 8 (mcpJNK KO), or 1 0 (mcpJNK KO+IL-6) per group. The groups were compared by ANOVA with Bonferroni’s multiple comparisons test. See also Figure S7. 

50% increase in IL-6 concentrations in WAT, which were more ysis and HGP in obese insulin-resistant adolescents were 

than 20 times higher than plasma IL-6 concentrations (Figures associated with increased CGI-58 protein expression in WAT 

7J and 7K). Similar to the HFD rats, the increased rates of lipol- (Figure 7L). 
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Figure 7. Insulin-Resistant Obese Adolescents Have Increased Lipolysis and Impaired Suppression of HGP Associated with Increased WAT 
IL-6 Concentrations 

(A and B) Fasting and 2-hr post-challenge plasma glucose. 

(C) Fasting plasma insulin. 

(D) HOMA. 

(E) Hepatic glucose production. Closed bars, basal; open bars, hyperinsulinemic-euglycemic clamp. 

(F) Fasting NEFA. 

(G) Suppression of glycerol turnover in the clamp. 

(H) Mean diameter of large adipocytes. 

(I) WAT macrophage counts. 

(J and K) Plasma and WAT IL-6 concentrations. 

(L) WAT CGI-58 protein. 

Data are the mean ± SEM of 9-21 controls and 15-39 insulin-resistant subjects. Comparisons were made by the two-tailed unpaired Student’s t test. See also 

Table SI . 



DISCUSSION 

Suppression of HGP by insulin is critical for maintenance of 
normal glucose homeostasis and alterations in this process 
play a major role in the pathogenesis of T2D; however, the 
molecular mechanism by which insulin suppresses HGP has re- 



mained elusive and has been attributed to direct actions of insu- 
lin on liver (Edgerton et al., 2009; Hepp, 1 977; Pilkis and Granner, 
1 992) and indirect effects of insulin on brain (Hill et al., 201 0; Jor- 
dan et al., 201 0; Lin et al., 201 0; Obici et al., 2002; Schwartz et al., 
2013) and adipocyte (Lewis et al., 1996; Rebrin et al., 1996). 
The canonical signaling pathway by which insulin has been 
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postulated to directly inhibit hepatic gluconeogenesis is through 
activation of Akt, resulting in suppression of gluconeogenic pro- 
tein expression by phosphorylation and exclusion of Forkhead 
box (Foxol ) from the nucleus of the hepatocyte, and consequent 
transcription-mediated reduction in hepatic gluconeogenesis 
(Haeusler et al., 2014). However, the finding that triple knockout 
mice lacking Akt 1 and Akt2 and Foxol in liver have normal sup- 
pression of HGP during a hyperinsulinemic-euglycemic clamp 
(Lu et al., 201 2) poses considerable challenges to this paradigm. 
Furthermore, insulin suppresses HGP within minutes of adminis- 
tration in vivo, which can unlikely be explained by transcription- 
ally mediated processes. 

To explain this apparent paradox, we hypothesized that insu- 
lin’s ability to suppress lipolysis in WAT may be critical for the 
suppression of HGP by reducing fatty acid flux to the liver result- 
ing in decreased hepatic acetyl CoA concentrations and 
decreased PC activity resulting in decreased conversion of pyru- 
vate to glucose. While hepatic acetyl CoA has been long known 
to modulate pyruvate carboxylase activity in vitro (Barritt et al., 
1966; Cazzulo and Stoppani, 1968; Cooper and Benedict, 
1 966; Keech and Utter, 1 963; Krebs et al., 1 965), methodological 
limitations stemming from acetyl CoA’s low hepatocellular con- 
centrations and rapid degradation ex vivo have prevented its 
measurement in vivo. Additionally, Chisholm et al. were not 
able to demonstrate any direct effect of insulin on pyruvate 
carboxylase activity in isolated hepatocytes (Chisholm et al., 
1983), begging the question of whether a hepatocyte-autono- 
mous process is primarily responsible for insulin-mediated sup- 
pression of PC activity and hepatic glucose production. 

In order to examine this hypothesis, we developed a compre- 
hensive metabolomics flux approach to simultaneously mea- 
sure whole-body rates of lipolysis, glucose turnover, and intra- 
hepatic fluxes as well as hepatic acetyl CoA content in rapidly 
freeze-clamped livers. Using this approach, we demonstrate 
that hepatic acetyl CoA content is a key regulator of 
HGP and that an insulin-mediated reduction of hepatic acetyl 
CoA concentrations, through inhibition of WAT lipolysis, is the 
major mechanism by which insulin suppresses HGP in vivo. 
While the ability of insulin to suppress HGP through an 
indirect mechanism was postulated by Levine and Fritz more 
than 50 years ago, the molecular mechanism regarding this 
effect has remained elusive. These are the first studies, to our 
knowledge, that demonstrate insulin regulation of pyruvate 
carboxylase activity through modulation of hepatic acetyl CoA 
concentrations and demonstrate that this mechanism is 
responsible for insulin suppression of hepatic glucose produc- 
tion in vivo. Furthermore, these data identify the mechanism 
by which mice lacking the canonical hepatic insulin signaling 
factors (Aktl, Akt2, Foxol) are able to suppress HGP normally 
and explain the enigmatic inability of insulin to suppress 
hepatic glucose production in isolated hepatocyte preparations 
(Lu et al., 2012). 

We next examined whether increased hepatic acetyl CoA con- 
centrations might be responsible for increased rates of hepatic 
glucose production in insulin resistance and might link inflamma- 
tion and macrophage-induced lipolysis in WAT with fasting and 
postprandial hyperglycemia through a similar mechanism. While 
lipid-induced insulin resistance occurs early in the pathogenesis 



of T2D and can be dissociated from inflammation at this 
stage, a key question arises as to identification of the factor(s) 
that promotes the progression from ectopic lipid-induced in- 
sulin resistance to impaired glucose tolerance and fasting hyper- 
glycemia (Shulman, 2014). The classic view of this process 
implicates impaired pancreatic (3-cell and a-cell function along 
with inflammation in this transition where the former leads to 
increased hepatic gluconeogenic gene transcription and the 
latter inhibits insulin action through the release of cytokines 
and adipocytokines, which, in turn, leads to inhibition of insulin 
signaling and increased hepatic gluconeogenic protein expres- 
sion through activation of the NF-icB/JNK/ceramide biosynthetic 
pathways (Olefsky and Glass, 2010). In this study, we explored 
an alternative hypothesis in which progression to fasting hyper- 
glycemia occurs through macrophage-induced WAT lipolysis re- 
sulting in increased hepatic pyruvate carboxylase activity 
through increases in hepatic acetyl CoA content and increased 
conversion of glycerol to glucose. Consistent with this hypothe- 
sis, we show that high-fat-fed rats manifested a 2-fold increase 
in plasma and WAT IL-6 concentrations, which caused hypergly- 
cemia by increasing lipolysis, hepatic acetyl CoA, and HGP both 
before and during a hyperinsulinemic-euglycemic clamp. Infu- 
sions of IL-6 and treatment of high-fat-fed rats with an IL-6- 
neutralizing antibody demonstrate that IL-6 is both necessary 
and sufficient to induce hyperglycemia in the rat. These findings 
were validated genetically in adipose-specific ATGL knockout 
mice and mcpJNK knockout mice, which were protected from 
high-fat feeding-induced hepatic insulin resistance due to reduc- 
tions in WAT lipolysis leading to decreased hepatic acetyl CoA 
content, decreased PC activity, and decreased rates of HGP 
during a hyperinsulinemic-euglycemic clamp. 

Finally, we show these results translate to humans by 
demonstrating that WAT inflammation is associated with 
increased IL-6 concentrations in WAT, increased rates of lipol- 
ysis, increased rates of HGP, and hepatic insulin resistance 
assessed by a hyperinsulinemic-euglycemic clamp in obese in- 
sulin-resistant adolescents. These results are consistent with 
previous studies, which have observed that increased plasma 
IL-6 concentrations are strong predictors of insulin resistance 
in humans (Fernandez-Real et al., 2001; Glund et al., 2007; 
Kado et al., 1999; Mohamed-Ali et al., 1997; Petersen et al., 
2006; Pickup et al., 1997; Steptoe et al., 2014; Straub et al., 
2000; Yuen et al., 2009). 

Taken together, these data demonstrate that allosteric regu- 
lation of pyruvate carboxylase activity by hepatic acetyl CoA, 
through insulin suppression of WAT lipolysis, is a critical factor 
in the regulation of hepatic glucose production by insulin in vivo. 
Furthermore, these studies demonstrate that macrophage- 
induced WAT lipolysis leading to increased hepatic acetyl 
CoA content and increased pyruvate carboxylase activity/flux 
is a key molecular mechanism linking WAT inflammation to 
both fasting and postprandial hyperglycemia in T2D. These 
data explain insulin’s inability to suppress hepatic glucose pro- 
duction in high-fat-fed rodent models of diet-induced obesity 
and hepatic insulin resistance independent of transcriptional 
regulation of gluconeogenic enzymes or alterations in hepatic 
glycogen metabolism and also explain recent studies that 
find no relationship between hepatic gluconeogenic protein 
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expression and fasting hyperglycemia in obese humans (Kuma- 
shiro et al., 2011). These data also challenge the canonical view 
of inflammation-mediated hepatic insulin resistance through 
alterations in circulating cytokines, which, in turn, lead to 
inhibition of hepatic insulin signaling and increased hepatic glu- 
coneogenic protein transcription through activation of the 
NF-KB/JNK/ceramide biosynthetic pathways. Finally, these re- 
sults have important clinical implications for the pathogenesis 
of T2D by linking inflammation and macrophage-induced WAT 
lipolysis leading to increased hepatic acetyl CoA concentrations 
and increased PC flux resulting in fasting and postprandial 
hyperglycemia and identify macrophage-induced WAT lipolysis 
and hepatic acetyl CoA as a novel potential therapeutic target 
for T2D. 

EXPERIMENTAL PROCEDURES 
Animal Studies 

Rodent protocols were approved by the International Animal Care and Use 
Committees at the institutions at which the animals were housed (Yale Univer- 
sity, the University of Pennsylvania, and the University of Massachusetts). 
Additional information about the animal models is provided in Extended Exper- 
imental Procedures. 

Basal and Hyperinsulinemic-Euglycemic Clamp Studies 

Basal infusion and hyperinsulinemic-euglycemic clamp studies were used to 
assess whole-body and hepatic insulin sensitivity as described in the 

Extended Experimental Procedures. 

Acute Suppression of Hepatic Gluconeogenesis 

To assess insulin’s acute ability to suppress gluconeogenesis, chow-fed rats 
were infused with [ 3 H] glucose (0.1 nCi/min) for 120 min. Blood samples were 
taken at 100, 110, and 120 min to assess basal glucose turnover. At 120 min, 
a 20-min insulin infusion was initiated (20 mU x kg -1 prime, 2 mU x kg -1 x 
min -1 for the duration of the study). Blood samples (100 |il) were taken every 
3 min during the insulin infusion, and specific activity was measured using a 
scintillation counter. 

Variable Glycerol and Acetate Infusions 

Sodium acetate was infused intra-arterially in rats at infusion rates 0, 130, 195, 
260, or 390 [xmol x kg 1 x min -1 . [3- 3 H] glucose was co-infused and blood 
and livers obtained as above. In a separate group of rats, [1 ,1 ,2,3,3-d 5 ] glycerol 
was infused at infusion rates 0, 10, 30, 50, or 70 [imol x kg -1 x min 1 
for 120 min. Glycerol turnover and concentrations were measured by gas 
chromatography/mass spectrometry (GC/MS) as described below. In the 
studies in which acetate and glycerol were co-infused during the clamp to 
abrogate insulin-mediated suppression of hepatic glucose production, 
the infusion rates selected were 195 ^imol x kg 1 x min -1 acetate and 
10 [xmol x kg -1 x min -1 glycerol. 

Measurement of Glucose, Fatty Acid, and Glycerol Turnover 

Whole-body glucose turnover was calculated as the ratio of the [3- 3 H] glucose 
infusion rate to the specific activity of plasma glucose at the end of the 1 20 min 
infusion. Endogenous glucose production represents the difference between 
the glucose infusion rate and the rate of glucose appearance. The liver was 
assumed to contribute 90% of whole-body glucose production based on 
our prior studies (Perry et al., 2014). Flepatic flux rates were calculated as 
we describe in Extended Experimental Procedures. [U- 13 C] palmitate (Beylot 
et al., 1 993) and [1 ,1 ,2,3,3-d 5 ] glycerol (Previs et al., 1 999) enrichment and total 
glycerol concentration were measured by GC/MS as previously reported, and 
turnover was calculated using the formula 



Turnover = 



13 C tracer enrichment 
13 C plasma enrichment 



* Infusion rate. 



ATGL Inhibition 

Groups of overnight fasted chow and high-fat-fed rats were treated with intra- 
peritoneal injections of atglistatin (200 or 600 i^mol/kg, as designated in the 
text, XcessBiosciences), a small-molecule ATGL inhibitor, or saline vehicle. 
They were then subjected to a basal infusion of [1 , 1 ,2,3,3-d 5 ] glycerol, 
[U- 13 C] palmitate, and [3- 3 H] glucose as described above. 

CPT-1 Inhibition 

Overnight fasted chow and high-fat-fed rats were treated with intraperitoneal 
injections of etomoxir (8 mg x kg ^) and immediately began an infusion of 
[1 ,1 ,2,3,3-d 5 ] glycerol, [U- 13 C] palmitate, and [3- 3 H] glucose as described 
above. 

Adipocyte Morphology and Function 

White adipose tissue glucose uptake was measured in basal and clamped 
rats following injection of 2-[1 - 14 C] deoxyglucose (Cartee and Bohn, 1995). 
Insulin-stimulated WAT lipogenesis and adipocyte cell size were measured 
in chow and 4-week high-fat-fed rats as previously described (Kursawe 
et al., 2010). To measure IL-6-induced lipolysis, ~100-mg WAT samples 
were incubated in 5 mM glucose media containing 15 or 25 nM IL-6 to mimic 
plasma IL-6 concentrations in vivo in chow and high-fat-fed rats. After 6 hr of 
incubation at 37°C, NEFA concentrations in the media were measured and 
normalized to the total protein content of the WAT sample, measured by Brad- 
ford assay. 

IL-6 Neutralization 

Overnight fasted 4-week high-fat-fed rats were treated with an intraperitoneal 
injection of a neutralizing antibody to IL-6 (1 0 pig/rat, R&D Systems) in saline, or 
an equivalent volume of saline. Immediately following the antibody injection, 
[1 ,1 ,2,3,3-d 5 ] glycerol, [U- 13 C] palmitate, and [3- 3 H] glucose were infused 
intra-arterially for the measurement of lipolysis and hepatic glucose production 
as described above. 

IL-6 Infusion 

A continuous intra-arterial infusion of IL-6 (10 pig/hr 1 , Sigma) was performed 
for 4 hr. After 2 hr of IL-6 infusion, atglistatin (200 ^mol/kg -1 ) was injected intra- 
venously, and rates of lipolysis and hepatic glucose production were assessed 
as described above. 

Biochemical Analyses 

Plasma glucose concentrations were measured using the YSI 2700 Select 
Biochemistry Analyzer, and plasma lactate concentrations by COBAS. Plasma 
insulin concentrations were measured by ELISA (Millipore). NEFA concentra- 
tions were measured using a Wako kit. Plasma insulin was measured by 
radioimmunoassay by the Yale Diabetes Research Core. Plasma FGF-21 , adi- 
ponectin, corticosterone, epinephrine, norepinephrine, growth hormone, and 
inflammatory cytokine concentrations were measured by ELISA (Millipore, 
Invitrogen, Abeam, IBL International, IBL International, Life Technologies, 
and QIAGEN, respectively). 

Tissue Analyses 

Liver acetyl CoA concentration and enrichment was measured by liquid 
chromatography/mass spectrometry/mass spectrometry (LC-MS/MS) as 
described in the Extended Experimental Procedures. Liver glycogen concen- 
trations were measured in basal and clamped rats using the method of Dalrym- 
ple and Hamm (1973). Pyruvate carboxylase activity was measured as 
previously described (Vatner et al., 2013), with acetyl CoA concentrations 
adjusted to match those measured by LC-MS/MS in each group. Liver ATP, 
ADP, AMP, NAD, and NADH concentrations were measured by LC-MS/MS 
as previously described (Perry et al., 2013). 

WAT Macrophage Isolation 

WAT tissue was washed with HBSS and then incubated at 37°C, shaking 
for 60 min in HBSS with 0.5 mg/ml collagenase (Sigma). After washing three 
times with HBSS, cell debris was reduced by Histopaque density centrifuga- 
tion. The remaining cells were stained with CDIIb-PE (eBioscience), and 
CD11b+ WAT macrophages were enriched to >90% purity by EasySep PE 
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positive selection kit (STEMCELL Technologies). Purity and cell number 
were confirmed by flow cytometry. WAT macrophage IL-6 and WAT tissue 
GM-CSF and TNF-a were measured by ELISA (Life Technologies, Abeam, 
and SABiosciences, respectively) and normalized to total protein content 
measured by Bradford assay. 

Western Blots 

Protein expression was measured in mouse and rat liver by western blot, with 
antibodies obtained from Cell Signaling Technology (Aktl , Akt2, Akt pSer473, 
Foxol , pyruvate carboxylase, phosphoenolpyruvate carboxykinase, glucose- 
e-phosphatase, adipose triglyceride lipase, insulin receptor substrate-1 , and 
glyceraldehyde 3-phosphate dehydrogenase) or from Santa Cruz Biotech- 
nology (CGI-58, peroxisome proliferator-activated receptor gamma, glucose 
transporter type 4, and protein phosphatase 1). 

Human Studies 

The Yale Pathophysiology of Type 2 Diabetes in Obese Youth Study is a long- 
term project aimed at examining early alterations in glucose metabolism in 
relation to fat patterning in obese adolescents. Specific details of this study 
are described in the Extended Experimental Procedures. 

Statistical Analyses 

Statistical analyses were performed using GraphPad Prism. Comparisons be- 
tween two groups were performed by the two-tailed unpaired Student’s t test, 
while three groups were compared by ANOVA with Bonferroni’s post hoc test. 
Differences were considered significant at p < 0.05. Data are presented as 
mean ± SEM. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Extended Experimental Procedures, seven 
figures, and one table and can be found with this article online at http://dx.doi. 
org/1 0.1 01 6/j. cell. 201 5.01 .012. 
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SUMMARY 

Sensitization of the capsaicin receptor TRPV1 is cen- 
tral to the initiation of pathological forms of pain, and 
multiple signaling cascades are known to enhance 
TRPV1 activity under inflammatory conditions. How 
might detrimental escalation of TRPV1 activity be 
counteracted? Using a genetic-proteomic approach, 
we identify the GABA B i receptor subunit as bona fide 
inhibitor of TRPV1 sensitization in the context of 
diverse inflammatory settings. We find that the 
endogenous GABA b agonist, GABA, is released 
from nociceptive nerve terminals, suggesting an 
autocrine feedback mechanism limiting TRPV1 
sensitization. The effect of GABA b on TRPV1 is inde- 
pendent of canonical G protein signaling and rather 
relies on close juxtaposition of the GABA b1 receptor 
subunit and TRPV1 . Activating the GABA b1 receptor 
subunit does not attenuate normal functioning of 
the capsaicin receptor but exclusively reverts its 
sensitized state. Thus, harnessing this mechanism 
for anti-pain therapy may prevent adverse effects 
associated with currently available TRPV1 blockers. 

INTRODUCTION 

Pathological forms of pain are usually triggered by injury or 
inflammation of peripheral sensory neurons of the pain pathway. 
A diverse set of inflammatory stimuli can sensitize nociceptive 
neurons to promote pain hypersensitivity. As a consequence, in- 
hibition of individual inflammatory pathways as a means to atten- 
uate pain is a problematic approach for drug development, as 
parallel signaling cascades are still able to drive pathological, 
pain-promoting sensitization (Gold and Gebhart, 2010). 

One receptor that has been found to serve as downstream 
integrator of many inflammatory pathways and thus holds 
great hope for anti-pain therapy is the capsaicin receptor 
TRPV1 (Caterina et al., 1997; Tominaga et al., 1998). 

Next to its physiological function as a detector of noxious 
stimuli, a large body of literature attests to a crucial pathological 
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role for TRPV1 . Importantly, inflammatory sensitization leads to 
dramatically reduced activation thresholds of TRPV1 , producing 
hyperalgesia and pain hypersensitivity. Indeed, Trpv1~ f ~ ani- 
mals completely lack thermal hyperalgesia, confirming TRPV1 ’s 
central role as integrator of disparate inflammatory pathways 
(Caterina et al., 2000; Davis et al., 2000). 

TRPV1 ’s central role in pathological forms of pain has sparked 
intense efforts to develop TRPV1 antagonists. However, block- 
ing TRPV1 activity per se leads to impaired noxious heat 
sensation and produces hypothermia, attesting to a role of this 
temperature-gated receptor in homeostatic thermoregulation. 
Therefore, preventing and counteracting sensitization of the re- 
ceptor while leaving basal TRPV1 activity untouched has been 
suggested to be a more promising avenue for pain therapy (Szal- 
lasi and Sheta, 2012; Vay et al., 2012; Woolf, 2010). 

Combining mouse genetics with proteomics, here we report 
the identification of the GABA B i receptor as a modulator of 
TRPV1 sensitization. 

Canonical GABA b signaling requires GABA b1 and GABA B2 
receptor subunits to orchestrate activation of inhibitory G pro- 
teins (Gassmann et al., 2004; Jones et al., 1998; Kaupmann 
et al., 1998; Kuner et al., 1999; White et al., 1998). Surprisingly, 
the GABA B2 subunit is dispensable for mediating inhibition of 
TRPV1 sensitization, demonstrating that the underlying mecha- 
nism diverges from canonical GABA b receptor signaling. 

Collectively, our results establish an endogenous GABA/ 
GABA b1 feedback mechanism that keeps TRPV1 -mediated 
pain hypersensitivity in check. 

RESULTS 

Transgenic Tag-TRPVI Recapitulates Native TRPV1 
Function 

Given the complex integrative function of the capsaicin receptor, 
we reasoned that analysis of TRPV1 accessory proteins would 
shed light on unrecognized modulation of the receptor. There- 
fore, we utilized a genetic-biochemical approach to identify 
components of TRPV1 protein complexes isolated from native 
sensory tissue of transgenic animals. 

To achieve efficient and specific biochemical purification of 
TRPV1 protein complexes for downstream analysis by mass 
spectrometry, we generated mice expressing an affinity-tagged 
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Figure 1. Tag-TRPVI Reproduces Native TRPV1 Expression and 
Function 

(A) Cartoon depicting Tag-TRPVI. 

(B) Western blot of solubilized membrane fractions of DRG derived from WT, 
Trpv1 ~'~ , and Trpv1~'~;Tag-Trpv1 mice. 

(C) Immunostaining of DRG derived from WT and Tag-Trpvl mice with anti- 
Flag (green) and anti-TRPVI (red) antibodies reveals that Tag-TRPVI re- 
capitulates the expression profile of the native receptor. 

(D) Tag-TRPVI expression rescues capsaicin responses in Trpv1 ~ 7 mice. 
Cultured DRG neurons obtained from WT, Trpv1~'~, and Trpv1^'~\Tag-Trpv1 
mice were challenged with capsaicin and the proportion of responders as- 
sessed by calcium imaging. Error bars represent SEM. 

(E) Immunostaining of Tag-TRPVI (anti-Flag antibody, green) and the pan- 
neuronal marker PGP9.5 (red) of corneas derived from WT and Tag-Trpvl mice 
reveals that Tag-TRPVI protein is transported to peripheral sensory endings. 
Scale bars, 50 urn. 

See also Figure SI. 



version of TRPV1 (Tag-TRPVI, Figure 1A). Western blotting of 
protein extracts derived from sensory ganglia (dorsal root 
ganglia, DRG) of transgenic animals revealed the expected 
size of the tagged receptor protein, which is slightly shifted to- 
ward higher molecular weight compared to native (untagged) 
TRPV1 (Figure 1 B). 

To assure identification of physiological relevant TRPV1 pro- 
tein complex components, we first verified cell-type-specific 
expression and functionality of the tagged receptor in BAC trans- 
genic animals. Tag-TRPVI expression recapitulated the native 
profile in DRG (Figures 1C, SI A, and SIB). Accordingly, the 
tagged receptor rescued capsaicin responses in DRG sensory 
neurons derived from transgenic animals crossed onto the 
TRPV1 knockout background (Figures 1 D and SI C). Importantly, 
the Tag did not impair trafficking of TRPV1 to peripheral and 
central nerve terminals (Figures 1 E and SI D). 

Finally, we tested TRPV1 -dependent nociceptive behavior. 
Whereas we observed normal responses to heat stimuli in a 
transgenic mouse line expressing close to physiological levels 
of Tag-TRPVI, a second line, expressing high levels of the 
tagged receptor (Figure S1E), exhibited significantly reduced 
pain thresholds (Figure SI F). These findings not only prove the 
functionality of the tagged receptor in vivo but also underscore 
TRPV1 ’s predominance in mediating heat hyperalgesia. 

Mass Spectrometry Reveals the GABA B i Subunit as a 
Constituent of TRPV1 Protein Complexes 

Next, utilizing the Strep or Flag moiety of the tagged receptor, we 
affinity purified protein complexes extracted from DRG of trans- 
genic animals expressing both the tagged and the native form of 
the receptor. Wild-type animals devoid of the tagged receptor 
served as control. 

Mass spectrometry revealed the presence of TRPV2 — a TRP 
family member previously shown to interact with TRPV1 (Rutter 
et al., 2005) — in purified protein complexes, further validating 
our approach. The GABA B i receptor subunit was identified as 
one of the most abundant components of native TRPV1 protein 
complexes, regardless of Tag-TRPVI expression level (Figures 
2A and S2A and Table SI). 

Interestingly, GABA b receptors in the CNS have long been 
known to mediate analgesia (Sokal and Chapman, 2003). How- 
ever, GABA b mechanisms specifically targeting the peripheral 
nociceptive pathway have not been identified. Surprisingly, 
GABA b receptors are robustly expressed in peripheral nocicep- 
tive neurons (Charles et al., 2001 ; Towers et al., 2000), suggest- 
ing an unrecognized role for GABA b in the pain pathway. We 
were able to confirm these results and found that most, if not 
all, TRPV1 -positive sensory neurons express the GABA b1 sub- 
unit (Figure 2B). 

We validated the presence of GABA b1 in TRPV1 protein com- 
plexes by immunoprecipitation (IP) of Tag-TRPVI or GABA b1 
(Figures 2C and 2D). 

To further confirm the close proximity of the two transmem- 
brane receptors in their native environment, we carried out prox- 
imity ligation assays (PLA). In this assay, a fluorescent signal is 
generated when two proteins of interest either physically interact 
directly or coexist within close molecular distance (Soderberg 
et al., 2006). We therefore cultured DRG neurons of Tag-Trpvl 
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Figure 2. GABA B1 and TRPV1 Reside in Common Protein Complexes 

(A) Mass spectrometric analysis and quantification of Flag-affinity-purified 
Tag-TRPVI protein complexes derived from DRG of mice expressing either 
high or low levels of Tag-TRPVI . Dot plot shows significantly enriched proteins 
as log 2 fold ratio (Tag-TRPVI AA/T). The analysis reveals specific and robust 
enrichment for GABA B i a protein. 

(B) Immunostaining of DRG of WT mice with anti-TRPVI (red) and anti-GABA B1 
(green) antibodies shows co-expression of both receptors in nociceptive 
neurons. Scale bar, 50 pm. 

(C and D) Immunoprecipitations of solubilized protein extracts derived from 
DRG of Tag-Trpvl or WT control animals using immobilized anti-Flag antisera 

(C) , anti-GABA B1 , or control antisera (D). Western blots are probed with anti- 
GABA B1 (bottom) or anti-TRPVI (top panels in C) or anti-Flag antisera (top 
panels in D). GABA B1 only co-elutes from affinity resin in the presence, but not 
in the absence, of TRPV1 (bottom right panel in C). Note that DRG neurons 
predominantly express the GABA B1a isoform and little or no GABA B1b . Simi- 
larly, TRPV1 is specifically enriched in isolated GABA B1 protein complexes (top 
right panel in D). Note that endogenous (untagged) TRPV1 protein is precipi- 
tated together with Tag-TRPVI , confirming the biochemical isolation of native 
tetrameric TRPV1 protein complexes (top right panel in C). 

(E) PLA of dissociated DRG neurons of Tag-Trpvl and WT animals. Antibodies 
against Flag and GABA B1 were used to detect proximity of the two receptors. 
PLA signal (red) is only present in cell bodies and neurites derived from 
Tag-Trpvl animals. Scale bar, 25 i^m. 

(F) BRET assay using Rluc-TRPVI and GABA B1 -YFP fusion proteins in the 
presence or absence of GABA B2 , showing that TRPV1 and GABA b1 strongly 
interact and that GABA B2 competes with TRPV1 for GABA b1 binding. Error 
bars represent SEM. 

(G) No interaction was detected between Rluc-TRPVI and GABA B2 -YFP. Error 
bars represent SEM. 

See also Figure S2 and Table SI . 



mice and wild-type control animals and performed PLA using 
anti-Flag and anti-GABA B i antibodies. A robust PLA signal was 
detected in Tag-Trpvl neurons, but not in WT controls, demon- 
strating that a TRPV1 -GABAbi complex is present in intact sen- 
sory neurons and their projecting neurites (Figure 2E). 

Intriguingly, we were not able to detect the GABA B 2 subunit 
either by mass spectrometry or by western blotting of TRPV1 
protein complexes (Figures S2B and S2C). Similarly, KCTD pro- 
teins— found to constitutively interact with the GABA B2 receptor 
subunit (Schwenk et al., 2010)— were absent, and only GABA b1 
selectively co-purified with TRPV1 complexes. 

Additionally, when co-expressing TRPV1, GABA b1 , and 
GABA B2 , lower amounts of GABA b1 protein were detected in 
TRPV1 IPs when compared to samples containing TRPV1 and 
GABA b1 alone (Figures S2D and S2E). These findings suggested 
that TRPV1 and GABA B2 might compete for binding to GABA b1 . 
To further examine this possibility, we carried out biolumines- 
cence resonance energy transfer (BRET) assays (Ayoub and 
Pfleger, 2010) and expressed Renilla Luciferase BRET donor 
(Rluc-TRPVI) and YFP BRET acceptor (GABA b 1 -YFP, GABA B2 - 
YFP, or YFP-TRPV1) fusion proteins in a heterologous cell 
expression system. While GABA b BRET probes have been 
characterized previously (Adelfinger et al., 2014), we first 
confirmed the functionality of the TRPV1 BRET probes (Figures 
S2F and S2G). 

The BRET assay confirmed close association of TRPV1 and 
GABA b1 (Figure 2F). Again, no interaction was detectable be- 
tween TRPV1 and GABA B2 (Figure 2G). Moreover, similar to 
the co-IP experiments (Figures S2D and S2E), co-expression 
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Figure 3. Effect of Baclofen on Native TRPV1 Receptors in Cultured 
Sensory Neurons 

(A) Electrophysiological recordings of cultured DRG neurons. TRPV1 currents 
elicited by capsaicin (500 nM) were not affected by application of 100 |xM 
baclofen. Rise time (10%-90%, in s): control, 6.5 ± 1.8; Baclofen, 5.8 ± 1.3. 
Decay (tau, s): control, 7.6 ± 1.5; Baclofen, 6.8 ± 1.8. 

(B) Ca 2+ responses (measured as normalized fluorescence ratios) of DRG 
neurons stimulated repetitively with capsaicin (100 nM). NGF (100 ng/ml) 
sensitized TRPV1 activity resulting in a larger response magnitude. Co- 
application of baclofen (100 |xM) with NGF resulted in inhibition of TRPV1 
sensitization. 

(C) Top: simplified cartoon showing the release of inflammatory mediators 
during tissue injury. Components of the “inflammatory soup” activate diverse 
receptor types, which signal via PKC, PKA, and lipid modifiers, such as PLC, to 
mediate TRPV1 sensitization. Bottom: quantification of averaged fluorescence 



of GABA B 2 interfered with the interaction of Rluc-TRPVI and 
GABAbi-YFP (Figure 2F), shown by the increased amount of 
BRET acceptor (GABA B i-YFP) required to reach 50% of the 
maximal BRET signal (Figures 2F, S2H, and S2I). 

In summary, GABA B i and TRPV1 form a protein complex that 
appears to lack the G protein signaling subunit GABA B 2 - 

GABA b Signaling Counteracts Sensitization of TRPV1 

Given GABA b ’s established role in modulating KIR3 and Ca v 
channels (Padgett and Slesinger, 2010), it is conceivable that 
GABAb’s known analgesic property could stem in part from 
inhibiting TRPV1 activity. 

Different to its effect on KIR3 channels (Figure S3A), neither 
calcium imaging nor electrophysiological recordings revealed 
any influence of the GABA b agonist baclofen on TRPV1 currents 
elicited by capsaicin (Figures 3A and S3B). Tachyphylaxis, a 
form of rapid TRPV1 desensitization, was also not affected by 
baclofen application (Figure S3C). 

Under inflammatory conditions, multiple pathways sensitize 
TRPV1 . Paramount among the different inflammatory sensitizers 
is NGF, activating parallel TRKA receptor signaling cas- 
cades that converge on the capsaicin receptor to enhance its 
sensitivity. 

We thus tested whether NGF-mediated TRPV1 sensitization 
is modulated by GABA b activation. To this end, we monitored 
nociceptive neuron populations for NGF-enhanced TRPV1 activ- 
ity using calcium imaging (Bonnington and McNaughton, 2003). 

Indeed, pre-incubation of sensory neurons with baclofen (but 
not saline) robustly blocked TRPV1 sensitization (Figures 3B, 
3C, and S3D). 

Apart from the receptor tyrosine kinase TRKA, several G-pro- 
tein-coupled receptors (GPCRs) mediate TRPV1 sensitization 
(Figure 3C). Notorious in this regard are bradykinin- and seroto- 
nin-triggered GPCR cascades (Chuang et al., 2001 ; Huang et al., 

2006). Strikingly, baclofen was equally effective in inhibiting 
TRPV1 sensitization induced by both inflammatory mediators 
(Figures 3C, S3E, and S3F). 

Given that the initial signaling events controlled by the two 
GPCRs are quite different to that of TRKA, these results suggest 
that GABAb exerts its effect at a converging point downstream of 
the different pathways, potentially at TRPV1 itself. In agreement 
with this hypothesis, bypassing upstream inflammatory receptor 
signaling by direct pharmacological PKC activation also resulted 
in baclofen-reversible TRPV1 sensitization (Figure 3C). 

The inflammatory prostaglandin PGE 2 mediates TRPV1 sensi- 
tization largely via a G s -coupled/PKA pathway (Gu et al., 2003; 
Lopshire and Nicol, 1998; Moriyama et al., 2005). Intriguingly, 
baclofen did not inhibit PGE 2 -mediated TRPV1 sensitization (Fig- 
ure 3C). Canonical GABA B -Gi/ 0 coupling would be expected to 
effectively counteract G s -mediated sensitization. In agreement 



ratios as shown in (B). Bar graphs represent the ratios between peaks 6 and 5 
(after and before application of inflammatory agents) in the presence or 
absence of baclofen (100 |xM). Baclofen effectively attenuated TRPV1 sensi- 
tization induced by NGF (100 ngl/ml), serotonin (100 ^iM), bradykinin (10 nM), 
and PMA (1 [xM), but not by PGE 2 (1 fxM). Cells treated with vehicle or baclofen 
served as a negative control. Error bars represent SEM. 

See also Figure S3. 
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Figure 4. Reversal of TRPV1 Sensitization by GABA b Is Recapitulated in Cellular Expression Systems In Vitro 

(A) Calcium responses elicited by 50 nM capsaicin pulses were assessed in TRPV1-HEK293 cells with or without GABA b after incubation with either baclofen 
(100 jxM), PMA (1 [j,M), or the combination of PMA+baclofen as indicated. Depicted are representative ratiometric traces from one experiment (n = 50 cells). PMA 
promotes PKC-induced TRPV1 sensitization, and baclofen inhibits sensitization only in the presence of GABA b . Error bars represent SEM. 

(B) Quantification of experiments shown in (A). PTX (500 ng/ml) did not abrogate the baclofen effect, indicating that G i/0 proteins are not involved in counteracting 
TRPV1 sensitization by GABA b . 

(C and D) Electrophysiology recordings of oocytes injected with Trpvl, TrkA, and Gaba B , following TRPV1 activation with pH 5.8, before and after treatment with 
NGF (100 ng/ml) (C) and NGF + baclofen (D). 

(E) Quantification of the experiments shown in (C) and (D); sensitization was measured as log 2 ratio of capsaicin-elicited steady-state currents before and after 
drug treatment. TRPV1 sensitization mediated by NGF is attenuated by baclofen only in the presence of GABA b . Error bars represent SEM. 

See also Figure S4. 



with the apparent absence of the GABA B 2 subunit from TRPV1 - 
GABAbi complexes, this result further pointed at a potentially un- 
recognized GABAb mechanism modulating TRPV1 sensitization 
independent of G i/0 signaling. 

Non-Canonical GABA b Signaling Counteracts TRPV1 
Sensitization 

To test directly whether G i/o protein is involved in mediating the 
observed GABA b effect, we pre-incubated sensory neurons 
with pertussis toxin (PTX). This selective and potent blocker of 
Gj/o proteins did not attenuate baclofen’s effect on TRPV1 sensi- 
tization (Figures S4A and S4B), whereas another Gj /o -signaling 
cascade present in DRG neurons was effectively inhibited 
by PTX (Figures S4C and S4D), thereby serving as a positive 
control. 

The positive allosteric modulator CPG7930 potentiates 
GABAb receptor signaling by binding to the GABA B 2 subunit 
(Binet et al., 2004). However, inhibition of TRPV1 sensitization 
was not significantly enhanced by CPG7930 (Figure S4E) at 
concentrations that robustly potentiated GABA B -induced KIR3 
currents (Figure S4F), again pointing toward a non-canonical 
GABAb pathway. 

Next, we tested GABA b ’s modulatory effect on TRPV1 sensi- 
tization in two heterologous expression systems. 



We first recapitulated PKC-induced TRPV1 sensitization in a 
TRPV1 -HEK293 cell line (Siemens et al., 2006). Pre-incubating 
the cells with baclofen in the absence of GABA b had no effect 
and did not abrogate PKC-mediated TRPV1 sensitization. How- 
ever, co-expressing GABA b completely reverted TRPV1 hyper- 
activity upon baclofen incubation (Figures 4A and 4B). Similar 
to our results in native sensory neurons, PTX did not prevent bac- 
lofen to exert inhibition of TRPV1 sensitization (Figure 4B). 

Additionally, we reconstituted the NGF sensitization pathway 
and expressed the TRKA receptor together with TRPV1 and 
GABAb in Xenopus oocytes. Again, we found that baclofen 
was able to significantly reduce NGF sensitization in a GABA b - 
dependent manner (Figures 4C-4E). 

In agreement with the observed close molecular proximity of 
the two receptors, these results demonstrate a direct inhibitory 
effect of GABAb receptors on TRPV1 sensitization, independent 
of classical G i/o signaling downstream of GABA B 2 - 

GABAbi Reverts TRPV1 Sensitization Independent of 
GABA B 2 Signaling 

In order to examine the role of the two GABA b receptor subunits 
in more detail, we next employed GABA b receptor knockout 
mice. A conditional “floxed” GABA B i mouse line (Gaba B1 f/f \ Hal- 
ler et al., 2004) was crossed to Trpvl -Cre mice (Cavanaugh et al., 
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Figure 5. Surface Expression of GABA B i 
in the Absence of GABA B 2 Is Sufficient to 
Mediate Inhibition of TRPV1 Sensitization 

(A) Quantification of capsaicin responses in DRG 
neurons from Trpv1-Cre;Gaba B1 f/f (Gaba B i~'~), 
Gaba B2 ~'~, and WT control mice in the presence 
or absence of baclofen (100 ^M) following incu- 
bation with serotonin (100 [xM). Error bars repre- 
sent SEM. 

(B) Cell surface expression of GABA B i and 
GABAbi -asaa in the presence or absence of 
GABA B 2 or TRPV1 was assessed by measuring 
the binding of [ 3 H]CGP54626 in intact CHO cells. 
Unspecific binding to non-transfected (NT) cells is 
set to 1 . Error bars represent SEM. 

(C) BRET experiment demonstrating that RLuc- 
TRPV1 and surface-localized GABA b -i-asaa-YFP 
fusion proteins interact. Error bars represent SEM. 

(D) Calcium responses in TRPV1-HEK293 cells 
transfected with GABA B i or GABA B i-asaa after 
incubation with either baclofen (100 fiM), PMA 
(1 |xM), or the combination of PMA+baclofen. Error 
bars represent SEM. 

(E) Quantification of serotonin-sensitized capsa- 
icin responses of cultured Gaba B2 ~'~ DRG 
neurons transfected with a fluorescent reporter 
(tomato) in the presence or absence of 
GABAbi-asaa plasmid, incubated with baclofen 
(100 nM). Error bars represent SEM. 

(F) Representative western blot showing phos- 
phorylated (pVI) and total TRPV1 protein (VI) 
of HEK cells transfected with control plasmid, 
GABAbi +B 2 . or GABA B i-asaa alone after treatment 
with PMA, PMA+baclofen, or vehicle. Three bio- 
logical replicates are loaded onto the gel, but only 
one representative lane for each condition is shown. 

(G) Quantification of (F). Phosphorylated TRPV1 
intensity values were normalized to total TRPV1 
intensities and expressed as percent of baclofen- 
mediated inhibition (PMA+baclofen/PMA). Error 
bars represent SEM. 

See also Figure S5. 
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2011) to specifically ablate the GABA B i subunit in TRPV1 -posi- 
tive sensory neurons (Figure S5A). As expected, inhibition of 
TRPV1 sensitization by baclofen was absent in sensory neurons 
derived from conditional Trpvl -Cre;Gaba B1 f/f mice, attesting to 
baclofen’s high specificity and selectivity for the GABA b1 subunit 
(Figure 5A). In DRG neurons derived from Gaba B2 ~'~ mice (Gass- 
mann et al., 2004), the baclofen effect was also abrogated (Fig- 



B1+ B1- 
B2 ASAA 



ure 5A), demonstrating that the GABA B2 
subunit is required for mediating TRPV1 
modulation, despite its apparent absence 
from TRPV1 protein complexes and the 
dispensability of G i/0 signaling down- 
stream of the GABA B2 subunit. 

It is well established that GABA b1 does 
not translocate to the cell surface 
autonomously, but the GABA B2 subunit 
mediates trafficking of the heteromeric 
receptor complex by masking an ER 
retention signal present in the GABA b1 protein (Margeta-Mitrovic 
et al., 2000; Pagano et al., 2001). We therefore asked whether 
TRPV1 promotes cell surface translocation of GABA b1 indepen- 
dent of the GABA B2 subunit. Analyzing cell surface binding of a 
GABA b1 -specific radioligand [ 3 H]CGP54626 (Galvez et al., 
2001), we find that TRPV1, different to GABA B2 , does not pro- 
mote cell surface translocation of GABA b1 (Figure 5B). 
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Mutating the ER-retention signal encoded by the peptide 
sequence “RSRR” to “ASAA” allows surface translocation of 
GABAbi independent of GABA B 2 without changing GABA b1 ’s 
signaling capability (Margeta-Mitrovic et al., 2000; Pagano 

et al., 2001). Interestingly, this constitutively surface-localized 
GABA b1 variant, GABA b1 _ AS aa (Figure 5B), interacts with TRPV1 
similar to its wild-type counterpart (Figure 5C), suggesting that 
the GABA b1 -TRPV1 protein complex can form at the cell surface. 

Expectedly, ER-retained native GABA b1 protein was not in the 
position to mediate inhibition of TRPV1 sensitization when ex- 
pressed in the absence of the GABA B2 subunit (Figure 5D). Strik- 
ingly, surface-localized GABA b1 _ A saa alone robustly inhibited 
TRPV1 sensitization in HEK293 cells (Figure 5D). Moreover, 
GABA b1 _ asaa reinstalled baclofen-mediated inhibition of TRPV1 
sensitization in GABA B2 -deficient sensory neurons (Figures 5E, 
S5B, and S5C), demonstrating the ability of GABA b1 to signal 
to TRPV1 directly and independently of the GABA B2 subunit. 

PKC phosphorylation of TRPV1 sensitizes the receptor (Man- 
dadi et al., 2006) and is a convergence point downstream 
of NGF, bradykinin, and serotonin signaling. We therefore 
examined whether PMA-induced PKC phosphorylation of 
TRPV1 is attenuated by GABA b1 signaling. Indeed, we find that 
GABA b1 _ asaa alone (similar to the native GABA B1+B2 receptor 
pair) effectively inhibits PKC phosphorylation of TRPV1 (Figures 
5F and 5G). 

GABA b Reverts TRPV1 Hypersensitivity in Models of 
Inflammatory and Neuropathic Pain 

Given the robust cellular effects of GABA b1 on TRPV1 sensitiza- 
tion, we next tested whether these results translate into GABA b1 - 
mediated inhibition of pain hypersensitivity in animal models. 
One nocifensive assay that allows robust measurement of 
TRPV1 -dependent heat hyperalgesia is the Flargreaves assay. 
In this assay, inflammatory stimuli, such as NGF and bradykinin, 
decrease the paw withdrawal latency in a TRPV1 -dependent 
manner (Chuang et al., 2001). Indeed, we observed significant 
inhibition of sensitization in baclofen-injected mice compared 
to mice that solely received NGF. No effects were detected in 
Trpv1 ~ animals (Figure S6A). To rule out any potential central 
effects of baclofen (Sokal and Chapman, 2003), we slightly modi- 
fied the testing paradigm and induced inflammatory hyperalge- 
sia in both hind paws of the same animal (Figure S6B). We found 
robust and dose-dependent inhibition of sensitization only in the 
ipsilateral paw that had received baclofen, but not in the contra- 
lateral paw (Figures 6A and S6C). Moreover, baclofen’s inhibitory 
effect was completely blunted in conditional Trpvl -Cre;Gaba B1 f/f 
mice (Figure 6B). These findings are in agreement with our 
cellular data (Figure 5A) and demonstrate that GABA b1 ’s benefi- 
cial effect on heat hyperalgesia is mediated in peripheral TRPV1 - 
positive nociceptors in a cell-autonomous fashion, rather than in 
higher-order pain-processing CNS areas. 

Similarly, GABA b1 receptor activation strongly attenuated se- 
rotonin- and bradykinin-triggered heat hyperalgesia but spared 
PGE 2 -triggered heat hyperalgesia (Figures 6A and S6D), an ef- 
fect that was not further enhanced by the positive allosteric 
modulator CPG7930 that stimulates GABA B2 (Figure S6E). 

For therapeutic purposes, it would be beneficial to identify an 
intervention that is able to revert pre-existing pain hypersensitiv- 



ity. Applying baclofen at a time point when NGF had already 
sensitized nociceptors reduced heat withdrawal thresholds 
(Figure S6F), establishing that GABA b1 activation has the capac- 
ity for both prevention and reversal of capsaicin receptor 
sensitization. 

Given the ability of the GABA b1 subunit to inhibit sensitization 
triggered by multiple different inflammatory pathways, we 
asked whether it is also effective in attenuating hyperalgesia 
as a consequence of intraplantar CFA (complete Freund’s adju- 
vant) injection, a widely used model of persistent inflammatory 
pain. We found that intraplantar application of baclofen 
2 days after inducing CFA-mediated inflammation significantly 
reduced heat hyperalgesia compared to vehicle-treated con- 
trols (Figure 6C). The beneficial effect of a single dose lasted 
for a minimum of 3 hr and had completely ceased 24 hr after 
application. 

Similar to the cellular assays, our behavior experiments 
confirm that GABA b1 activation specifically attenuates hyper- 
sensitivity of TRPV1 but did not impinge upon acute responsive- 
ness of the receptor to either a heat stimulus, as tested in the 
Flargreaves assay (Figure S6G), or assessed by measuring acute 
capsaicin-induced nocifensive responses (Figure S6H). 

GABAbi and TRPV1 Co-Localize at Peripheral Nerve 
Endings 

Our data suggest that GABA b1 receptors mediate their anti-hy- 
peralgesic effect in the periphery. Consequently, GABA b1 would 
be expected to reside in free nerve endings in close proximity to 
TRPV1. To elucidate whether the two receptors indeed co- 
localize at nociceptor endings, we utilized transgenic Gaba B1 - 
GFP reporter mice (Casanova et al., 2009). Similar to skin, the 
cornea is densely innervated by TRPV1 -positive sensory fibers, 
attesting to strong reactions of the eye to inflammatory insults. 
We analyzed both skin and cornea tissue for co-expression of 
the two receptors in peripheral nerve endings that project from 
DRG and trigeminal (TG) sensory neurons, respectively. Not 
only did we find GABA b1 to be present in peripheral nociceptive 
fibers, but we also observed substantial co-localization with 
TRPV1 in nerve terminals (Figures 6D-6F and S6I-S6K and 
Movie SI). In agreement with the close association of TRPV1 
and GABA b1 found in Tag-TRPVI -positive DRG neurons (Fig- 
ure 2E), 89% of TRPV1 -positive corneal fibers co-localize the 
two receptors at their terminals (Figure S6L). 

GABA Is Released from Peripheral Nociceptors and 
Modulates Pain Hypersensitivity 

Interestingly, we found that local application of the competitive 
GABA b1 receptor antagonist CPG52432 enhanced NGF-medi- 
ated TRPV1 sensitization and induced a small but significant 
increase in mouse pain behavior compared to NGF alone (Fig- 
ure 7A). This result suggested that GABA, the native agonist 
of GABA a and GABA b receptors, is endogenously present at 
peripheral nerve endings to produce a basal GABA b receptor 
tone that regulates TRPV1 sensitivity. Indeed, in blister fluid ob- 
tained from human skin and extracted mouse corneal fluid, we 
find GABA at concentrations (161 ± 42 nM and 335 ± 54 nM, 
respectively) sufficient to inhibit TRPV1 sensitization in cultured 
DRG neurons in a GABA B1 -dependent manner (Figure 7B). 
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Figure 6. GABA B i Activation in Peripheral Nociceptive Terminals Decreases Thermal Hyperalgesia 

(A) Thermal responses were measured using the Hargreaves test. NGF (2 ^ig/paw), serotonin (5HT, 100 nmol/paw), bradykinin (10 nmol/paw), or PGE 2 (1 nmol/ 
paw) were injected into both mouse hind paws 30 min after unilateral administration of baclofen (Bac, 3 |ag/paw) into the ipsilateral paw and vehicle (Ctr) into the 
contralateral paw. Bar graph shows the differences in mean paw withdrawal latency upon radiant heat stimulation relative to the basal (non-stimulated) with- 
drawal latency. Baclofen decreased thermal hyperalgesia for all inflammatory mediators, except PGE 2 . Basal withdrawal latencies were measured 24 hr before 
the experiment. Error bars represent SEM. 

(B) Baclofen does not attenuate NGF-induced thermal hyperalgesia in conditional Trpv1-Cre;Gaba B1 f/f mice (( Gaba B1 ~'~ ). Error bars represent SEM. 

(C) Heat hyperalgesia 2 days after CFA induction is attenuated by baclofen (3 i^g/paw) for a minimum of 3 hr when compared to mice that received vehicle only 
(saline). Error bars represent SEM. 

(D) Cartoon depicting skin or cornea innervation shown in (E) and (F). 

(E) Immunostaining of skin and cornea sections obtained from Gaba B1 -GFP transgenic mice and labeled with antisera for TRPV1 (red), GFP (green), and Tujl 
(blue). Scale bars, 20 ^m. 

(F) Higher magnification of a single corneal fiber. GABA B i -GFP localizes in TRPV1 -positive fibers and is concentrated at the terminal close to the epithelial surface 
(demonstrated by orthogonal maximal projections in the x, y, and z dimensions). Scale bar, 10 urn. 

See also Figure S6 and Movie SI . 
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Figure 7. Peripheral Effects and Localization of GABA and Its 
Release from Peripheral Fibers 

(A) Withdrawal latencies in mice sensitized with NGF are significantly reduced 
by the application of the GABA B i inhibitor CPG52432 (1 7 nmol/paw) compared 
to mice that received vehicle. Error bars represent SEM. 

(B) 200 nM GABA is sufficient to attenuate serotonin sensitization of TRPV1 in 
WT but not in Trpv1-Cre;Gaba B1 f/f ( Gaba B1 ~ /_ ) sensory neurons. Error bars 
represent SEM. 

(C) Cartoon depicting cornea innervation shown in (D); nerve fibers emerge 
from the sub-basal nerve plexus and branch out toward the corneal surface. 

(D) Sensory fibers innervating the mouse cornea were immune labeled with 
antisera decorating GABA (red) and TRPV1 (green). GABA is either distributed 
throughout the length of the fibers (left) or concentrated at the terminals close 
to the corneal surface (middle) of TRPV1 -positive fibers. Although most fibers 
contain GABA and TRPV1 conjointly, some fibers harbor GABA in the absence 
of TRPV1 (right). Scale bar, 10 urn. 

(E) Corneas of WT mice either treated with capsaicin (10 |xM) or vehicle 
(10 min, 32°C) and subsequently prepared for immunostainings using antisera 
recognizing GABA and TuJI. GABA staining was strongly reduced in capsa- 
icin-treated corneas compared to control, indicating that TRPV1 activation 
stimulates GABA release. Scale bar, 50 ^m. 

(F) Quantification of (E). Error bars represent SEM. 

(G) Model illustrating inhibition of TRPV1 sensitization by GABA B1 receptor 
signaling. TRPV1 is sensitized by a multitude of signaling cascades that are 
initiated in the context of inflammation. Several cascades converge on PKC, a 
kinase shown to robustly mediate TRPV1 sensitization via direct receptor 
phosphorylation. Sensitization augments TRPV1 channel activity, which leads 
to increased calcium influx and concomitant GABA release to attenuate 
TRPV1 sensitization via GABA B1 . Activation of this autocrine loop dampens 
(or resolves) hyperactivity of TRPV1 . 

See also Figure S7. 

Although GABA is well known for its presence in inhibitory 
CNS neurons, much less is known about its localization in the 
periphery. Using a GABA-specific antibody, we found GABA 
localized to the terminal endings of corneal nociceptors, many 
of which also express TRPV1 (Figures 7C, 7D, and S7A). Given 
that primary afferent nociceptors are glutamatergic, it was unex- 
pected to find GABA at their peripheral endings. Presumably, in 
these fibers, GABA is stored in vesicles as many of the terminals 
also reacted with antibodies for vGAT, the vesicular GABA trans- 
porter (Figure S7B). Additionally, DRG and TG sensory neurons 
exhibit low Gad2 transcript levels (Figure S7C). Consistently, 
we find GABA to be present in DRG and TG neuronal cell bodies 
(Figure S7D), albeit the GABA labeling was less pronounced 
compared to that observed in TRPV1 -positive sensory terminals 
(Figure 7D), likely reflecting efficient vesicular transport to distal 
terminals. 

Given the juxtaposition of GABA vesicles and TRPV1 at 
peripheral nerve terminals, we tested whether TRPV1 activation 
and subsequent calcium influx would constitute an adequate 
stimulus to release GABA. Stimulating isolated corneas with 
capsaicin reduces GABA content at the terminals compared to 
corneas kept without capsaicin, strongly suggesting that 
GABA release has taken place (Figures 7E and 7F). The capsa- 
icin-mediated release of GABA is specific to TRPV1 activation, 
as it was absent from capsaicin-stimulated corneas of Trpv1~'~ 
mice (Figure S7E). 

Taken together, these results support a model by which pe- 
ripheral GABA limits (and resolves) TRPV1 sensitization via 
GABA b1 receptors, directly at the site where painful stimuli are 
first encountered (Figure 7G). 
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DISCUSSION 

Genetic-Proteomic Dissection of TRPV1 Modulation 

Counteracting sensitization of TRPV1 —without blocking acute 
channel activity— is a promising approach for developing 
analgesics. Using infusible peptides mimicking intracellular 
TRPV1 domains, it has been demonstrated that interference 
with TRPV1 sensitization has beneficial effects in models of in- 
flammatory pain (Fischer et al., 2013). However, in contrast to 
a multitude of pathways sensitizing TRPV1 , endogenous path- 
ways inhibiting sensitization have remained elusive. 

Combining a genetic-biochemical approach with quantitative 
mass spectrometry to probe the molecular environment of 
native TRPV1 receptors, we report here the identification of the 
GABAbi receptor as a modulator of TRPV1 sensitization. 

Non-Canonical GABA b Signaling Aborts TRPV1 
Sensitization 

In CNS neurons, most, if not all, GABA b receptor effects are 
mediated by coupling to inhibitory Gj/ 0 -type G proteins (Padgett 
and Slesinger, 2010). Activating G i/0 -coupled signaling in sen- 
sory neurons is able to promote TRPV1 sensitization (Forster 
et al., 2009; Loo et al., 2012). Hence, it is difficult to reconcile 
the observed inhibitory GABA b effect on TRPV1 sensitization 
with a classical Gj/ 0 -coupled signaling cascade. 

Multiple lines of evidence suggest that a GABA b1 mechanism 
that is independent of GABA B2 signaling targets TRPV1 hypersen- 
sitivity: (1) GABA B2 appears to be absent from GABA b 1 -TRPV1 
complexes; (2) neither PTX nor CPG7930— inhibiting G i/o activa- 
tion or enhancing G protein coupling of GABA B2 , respectively- 
had any effect on GABA b ’s potential to attenuate TRPV1 sensiti- 
zation; and (3) plasma-membrane-localized GABA b1 can inhibit 
TRPV1 sensitization in the absence of GABA B2 . 

Our data suggest that a ternary complex encompassing TRPV1 
and both GABA b subunits is not favored; rather, GABA B2 com- 
petes with TRPV1 for GABA b1 interaction. Interestingly, it has 
been shown that TRPV1 activation by capsaicin can promote 
dissociation of the two GABA b subunits (Laffray et al., 2007). 
Thus, it is tempting to speculate that TRPV1 activation promotes 
the formation of a surface-localized TRPV1 -GABA b1 liaison at the 
expense of a heteromeric GABA b 1 -GABA B2 receptor complex, 
thereby priming the nociceptor to guard against sensitization. 

GABA b Inhibits Convergent PKC-Mediated TRPV1 
Sensitization 

How does the GABA b1 receptor mediate its effect on TRPV1? 
Given the close proximity of the two proteins, it is possible that 
baclofen-induced conformational changes are directly trans- 
mitted onto the TRP ion channel to control and regulate its sensi- 
tization status. This would be reminiscent of dopamine receptors, 
which directly modulate GABA A -type ion channels, independent 
of a downstream signaling cascade (Liu et al., 2000). 

Alternatively, a putative signaling effector downstream of 
GABA b1 may inhibit TRPV1 sensitization. Signaling cascades 
downstream of GABA b1 (and independent of GABA B2 ) have 
remained elusive. However, (3-arrestin2 has been implicated 
to act downstream of GABA b receptors, independent of G i/0 
protein signaling (Lu et al., 2012). (3-arrestin2 has also been 



proposed to modulate tachyphylactic TRPV1 responses (Por 
et al., 2012), suggesting that this protein could, in principle, 
couple GABA b receptors to TRPV1 . However, we did not detect 
p-arrestin2 in TRPV1-GABA b1 protein complexes, nor did we 
find tachyphylactic TRPV1 responses to be modulated by 
GABA b receptor activation (Figure S3). 

Different to the important functions for p-arrestins in other 
GPCR signaling cascades, their role in GABA B -mediated 
pathways is less clear (Perroy et al., 2003; Sudo et al., 2012), 
suggesting that p-arrestin2 is an unlikely candidate to mediate 
the modulatory action of GABA B i on TRPV1 . 

Different than the inflammatory pathways initiated by NGF, se- 
rotonin, or bradykinin, we find that PGE 2 -triggered sensitization 
of TRPV1 was not inhibited by the GABA B i receptor. The differ- 
ential susceptibility to GABAbi receptor modulation coincides 
with different points of signal convergence on TRPV1 ; a common 
denominator of NGF, serotonin, and bradykinin signaling is the 
activation of PLC/PKC pathways that result in sensitization of 
the capsaicin receptor (Huang et al., 2006). By triggering PKC 
activation directly, we find that this branch of TRPV1 sensitiza- 
tion can be inhibited by GABAbi receptor activity, resulting in 
reduced TRPV1 phosphorylation. 

In contrast, PGE 2 signaling leads to PKA-dependent phos- 
phorylation of TRPV1 (Gu et al., 2003; Lopshire and Nicol, 
1998; Moriyama et al., 2005), and its sensitizing effect was not 
attenuated by GABA b -|. Phosphorylating TRPV1 at different sites 
may have different functional consequences despite a similar net 
effect on sensitization — recent studies demonstrate unique 
structural features of TRPV1 with an upper and lower gate that 
are independently engaged by different pro-algesic agents to 
open and modulate ion channel conductance (Cao et al., 
2013). Thus, it is conceivable that differential phosphorylation 
by PKC and PKA mediates different types of TRPV1 gating, the 
first of which is susceptible to GABA B i receptor-mediated inhibi- 
tion, whereas the latter is not. 

Given that PKC is not the sole mediator of TRPV1 sensitization 
utilized by the inflammatory pathways susceptible to GABA B i 
inhibition, it is very well possible that other mechanisms are 
also targeted and that GABA b provides a broader protection 
(or “shielding”) against TRPV1 sensitization. 

GABAb receptors trigger a surprising range of different cellular 
responses. Heteromeric receptor assembly, as well as the recent 
discovery of auxiliary GABA b subunits (Schwenk et al., 2010), 
has been proposed to explain contextual signaling diversity 
and challenge the classical view of GPCR organization and func- 
tion. Our study further corroborates GABA b receptor signaling 
complexity and highlights a GABA B 2 -independent pathway that 
mitigates TRPV1 -mediated pain hypersensitivity. 

Peripheral GABA Regulates Nociceptor Sensitization 

It came as a surprise to find not only GABA b receptors but also 
their cognate agonist, GABA, localized at peripheral nerve termi- 
nals. Classically, vesicular GABA is a hallmark of inhibitory syn- 
apses of CNS neurons. Although it is known that the excitatory 
transmitter glutamate is present in peripheral nerves, where 
it may promote excitability and neurogenic inflammation (Miller 
et al., 2011), to our knowledge nothing similar has been reported 
for inhibitory transmitters. 



768 Cell 160 , 759-770, February 12, 2015 ©2015 Elsevier Inc. 




Cell 



Here, we provide evidence that, under physiological condi- 
tions, peripheral GABA limits TRPV1 -mediated hyperalgesia. It 
will be interesting to ascertain whether the peripheral GABA- 
GABAbi feedback on TRPV1 is altered under pathological pain 
conditions. Exploiting this endogenous feedback system (e.g., 
by promoting peripheral GABA release or rendering a pharmaco- 
logical GABAbi agonist non-permeable to the blood-brain bar- 
rier) may be a valuable route for anti-pain therapy, circumventing 
severe adverse effects associated with baclofen’s dominant 
CNS activity. 

Processing and modulation of painful signals have mainly 
been attributed to higher-order brain centers such as the dorsal 
spinal cord and beyond. Our model may— after all — not only 
inspire new approaches for developing TRPV1 -centric pain ther- 
apeutics, but it also offers a fresh look at modulation of sensory 
input directly at the site of sensory transduction in nerve 
terminals, a paradigm that may also be relevant for other so- 
matosensory modalities. 

EXPERIMENTAL PROCEDURES 

Generation of Tag-Trpvl BAC Transgenic Mice 

The SF (Strep-Flag) tag was seamlessly integrated into a mouse BAC clone 
(RP23-390G23) encoding the Trvpl genomic locus, and transgenic mice 
were obtained by pronuclear injection. 

Behavioral Studies 

All animal experiments were in accordance with the local governing bodies. 
Thermal pain was assessed by measuring the response latency to a radiant 
heat stimulus focused onto the plantar surface of the paw (Flargreaves assay). 

Biochemical Protein Complex Purification and Mass Spectrometry 
Analysis 

Following plasma membrane fractionation of DRG, protein complexes were 
affinity isolated using Anti-Flag magnetic beads. Liquid chromatography-tan- 
dem mass spectrometry (LC-MS/MS) analysis was performed with in-solution 
digested affinity purified protein samples on a Q Exactive mass spectrometer 
(Thermo Scientific). Label-free quantitation (LFQ) was performed using 
MaxQuant Analysis Software. 

BRET and PLA Assays 

Luminescence and fluorescence signals of COS-1 cells transiently transfected 
with plasmids encoding Rluc BRET donor and YFP BRET acceptor fusion pro- 
teins were analyzed using an Infinite F500 microplate reader (Tecan). 

PLA was performed using Duolink reagents (Sigma) per manufacturer’s 
instructions. 

Calcium Imaging and Electrophysiological Recordings 

For calcium imaging experiments, primary sensory neurons or HEK293 cells 
were loaded with the calcium indicators Fura-2 or Cal-520 AM. Electrophysio- 
logical recordings were performed using a 700B amplifier and 1440A Analog 
Digital Converter (Molecular Devices) in whole-cell voltage clamp configuration. 

GABA Measurements 

GABA concentrations were determined by LC-MS/MS at Brainsonline 
(Groningen). 

Statistical Analyses 

Data are presented as mean ± SEM. Statistical significance was evaluated 
employing Student’s t test for paired comparisons unless indicated otherwise, 
p values < 0.05 were considered statistically significant, with *p < 0.05, **p < 
0.01, and ***p < 0.001. 

Additional Information is available in the Extended Experimental 
Procedures. 
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The heterogeneity of aneuploid cell 
populations increases with stress, 
causing resistance to emerge. An 
eradication strategy involves sequential 
applications of stress: the first stress 
homogenizes the population via 
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targets and eliminates the newly 
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Highlights 

• Higher stress leads to larger phenotypic variation in 
heterogeneous aneuploids 

• Eradication of aneuploids via dual-stress application: 
“evolutionary trap” (ET) 

• The first stress selects for a less diverse population; the 
second targets it 

• ET may be applicable toward azole resistance in Candida 
albicans and human cancer 
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SUMMARY 

Aneuploid genomes, characterized by unbalanced 
chromosome stoichiometry (karyotype), are associ- 
ated with cancer malignancy and drug resistance of 
pathogenic fungi. The phenotypic diversity resulting 
from karyotypic diversity endows the cell population 
with superior adaptability. We show here, using a 
combination of experimental data and a general sto- 
chastic model, that the degree of phenotypic varia- 
tion, thus evolvability, escalates with the degree of 
overall growth suppression. Such scaling likely ex- 
plains the challenge of treating aneuploidy diseases 
with a single stress-inducing agent. Instead, we 
propose the design of an “evolutionary trap” (ET) tar- 
geting both karyotypic diversity and fitness. This 
strategy entails a selective condition “channeling” 
a karyotypically divergent population into one with 
a predominant and predictably drugable karyotypic 
feature. We provide a proof-of-principle case in 
budding yeast and demonstrate the potential effi- 
cacy of this strategy toward aneuploidy-based azole 
resistance in Candida albicans. By analyzing existing 
pharmacogenomics data, we propose the potential 
design of an ET against glioblastoma. 



INTRODUCTION 

Germline evolution shapes the organismal tree in changing envi- 
ronments during the long course of natural history, whereas 
acute environmental fluctuation drives asexual cellular evolution, 
often associated with dynamic structural changes in the ge- 
nomes of microbes or cancer cells (Lewontin, 1970; Merlo 
et al., 2006). Aneuploidy (chromosome copy-number imbalance) 
is a type of genome alteration widely observed during cellular 
evolution of eukaryotic species, such as laboratory (Chen 
et al., 2012a; Hughes et al., 2000; Rancati et al., 2008), industrial 
(Borneman et al., 2011; Chen et al., 2012b; Infante et al., 2003; 
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Kvitek et al., 2008), and pathogenic (Marichal et al., 1997; Ni 
et al., 2013; Selmecki et al., 2006; Sionov et al., 2010) yeasts, 
as well as protozoan parasites such as leishmania (Leprohon 

et al., 2009; Mannaert et al., 2012; Ubeda et al., 2008) and try- 
panosomes (Llewellyn et al., 2011; Minning et al., 2011). 

Emerging evidence also points to aneuploidy as an important 
driver for the evolution of human cancer (Davoli et al., 2013; 
Holland and Cleveland, 2009; Jones et al., 2010; Ng et al., 
2010; Wang et al., 2014). Due to its impact on the expression 
of many genes, aneuploidy brings about large phenotypic 
changes that can be either detrimental or beneficial in a karyo- 
type- and environmental condition-dependent manner (Chen 
et al., 2012b; Pavelka et al., 2010). 

Aneuploid populations are often characterized by heterogene- 
ity— the coexistence of many different karyotypes. The genetic 
diversity provides the raw material for evolutionary selection 
and endows the aneuploid population with high adaptive poten- 
tial (Burrell et al., 2013; Chen et al., 2012a; Maley et al., 2006). 
This underscores the exceptional challenge of treating dis- 
ease-causing cell populations characterized by large karyotype 
heterogeneity and instability (Gerlinger et al., 2012; Harrison 
et al., 2014; Lee et al., 2011; Navin et al., 2011; Sotillo et al., 
201 0). One idea is to find drugs strongly exacerbating a common 
deficiency of aneuploids irrespective of specific karyotype (Oro- 
mendia and Amon, 201 4). A study found that among a collection 
of disomic yeast strains, many showed prominent growth de- 
fects toward agents that perturb proteome homeostasis, such 
as hygromycin B (a translation inhibitor) or geldanamycin (an 
Hsp90 inhibitor) (Torres et al., 2007), supporting the notion that 
a common deficiency of aneuploidy is overloading of the prote- 
ome quality-control system. However, some of the disomic 
strains exhibited resistance, as opposed to sensitivity, toward 
proteotoxic agents, and one of the karyotypic features, 
increased copy number of chromosome XV (chrXV), emerged 
as the adaptive variant when a diploid strain was evolved in 
the presence of an Hsp90 inhibitor (Chen et al., 2012a). In addi- 
tion, when a highly heterogeneous aneuploid population was 
treated with drugs that imposed strong immediate growth inhibi- 
tion, long-term culturing enabled evolutionary selection for a few 
variants that eventually rendered the population drug resistant 
(Chen et al., 201 2a) (also see results of this study). These findings 
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highlight the fallacy of short-term efficacy in drug treatments 
when dealing with heterogeneous populations already poised 
for rapid adaptation. 

In this study, we aim for an innovative approach that accounts 
for the evolutionary dynamics and achieves long-term growth 
suppression or extinction of aneuploid cell populations consist- 
ing of a wide spectrum of karyotypes. Our analyses demonstrate 
on a general level that the evolutionary potential of a heteroge- 
neous aneuploid population escalates under increasing stress, 
but such a population may become highly targetable once the 
karyotype diversity is drastically confined. These findings led 
us to design a two-component strategy for treating diseases 
associated with aneuploid cell populations that targets the pop- 
ulation’s adaptability and fitness. 

RESULTS 

Phenotypic Variation Scales with Growth Suppression 
under Diverse Stress Conditions in Budding Yeast 

To investigate whether certain stress conditions may be consis- 
tently effective toward aneuploids with a wide-spectrum of kar- 
yotypes, we performed a re-analysis of the growth data from a 
previous study subjecting a panel of 38 aneuploid yeast strains 
(S. cerevisiae) with diverse and random chromosome stoichiom- 
etry to phenotypic profiling across a wide range of conditions 
with varying stress types and levels (Pavelka et al., 2010) (Fig- 
ure 1A). This analysis revealed a surprising trend: those condi- 
tions more toxic to the aneuploidy cohort (lower mean growth 
ratio) also produced a larger fitness spread among the aneu- 
ploids, such that even under highly toxic conditions, although 
many aneuploids failed to grow, some aneuploids endured or 
thrived. The standard deviation (spread) and mean of the growth 
ratio (overall growth suppression) of diverse aneuploids are 
strongly correlated across diverse stress conditions (Figure 1 B). 
This same trend could also be observed with the set of disomic 
aneuploid strains isolated in a previous study (Torres et al., 
2007) treated with increasing concentrations of different drugs 
(Figure 1C). 

We used a stochastic model to investigate the theoretical 
generality of the above experimental observation (Figures 1 D, 
IE, SI A, and SIB; see Extended Experimental Procedures for 

a detailed description of assumptions and simulations). Briefly, 
we assumed that fitness is governed by N independent path- 
ways, reflecting the modular architecture of cellular systems. 
We assume that for each individual pathway the distribution of 
pathway activity across diverse karyotypes assumes a simple 
normal distribution. Under the stress-free condition, the optimal 
fitness is reached by the euploid at the peak of the A/-dimen- 
sional normal distribution (Figure ID; Extended Experimental 
Procedures), whereas aneuploid populations are located at po- 
sitions away from the peak, reflecting suboptimal fitness. In the 
presence of stress, however, the optimal fitness point moves 
away from the euploid position (Figure 1 E). The change in the dis- 
tance between the position of each karyotype and the optimal 
fitness point represents the fitness change caused by stress. 
Simulation of this model with varying values of N revealed two 
properties of the system remarkably consistent with experi- 
mental observations (Figures 1 F and 1 G). First, under stress con- 



ditions with sufficient magnitude, most aneuploids had lower 
fitness than the euploid, yet there was always a fraction of aneu- 
ploids that assumed higher fitness than the euploid. Second, the 
absolute value of the mean and standard deviation of the relative 
fitness satisfied a positive linear correlation across diverse stress 
conditions and a range of N values (Figures IF, 1G, and S1C- 
S1 F). The fact that such a simple model could qualitatively cap- 
ture the experimental data suggests the observed phenomenon 
to be a general property of heterogeneous aneuploid popula- 
tions irrespective of specific stress applied. 

Therapeutic Compounds Elicit Phenotypic 
Heterogeneity in Human Tumor Cell Lines 

A vast majority of solid tumors are aneuploid: 91 .7% of 81 7 solid 
tumor cell lines surveyed by Cancer Cell Line Encyclopedia 
(CCLE) have at least 1 chromosome arm-level copy-number 
variation, whereas 58.9% have more than 10 (see Extended 
Experimental Procedures). Karyotypic heterogeneity has also 
been observed within a single tumor (Gerlinger et al., 2012; Navin 
et al., 201 1). The relapse fueled by adaptive changes in a tumor 
remains a major challenge in cancer treatment. A logical exten- 
sion of our findings is that, in karyotypically heterogeneous can- 
cer cell populations, the treatment suppressing the overall 
growth may also escalate tumor cells’ phenotypic heterogeneity 
and potentially adaptability. To examine this possibility, we first 
investigated a dataset consisting of 54 karyotypically divergent 
human breast cancer cell lines and 77 different anticancer drugs 
across 10 concentrations (SU2C breast cancer project; Heiser 
et al., 2012). As shown in Figures 2A and 2B, in drug doses 
that caused prominent (>50%) overall growth suppression, the 
spread of growth rates among cell lines treated with most chem- 
icals (~95%) also increased. A few drugs showed more consis- 
tent toxicity, but due to the lack of a normal euploid control, it is 
unknown whether this was specific to aneuploid cancer cells. 
Examination of the correlation between overall growth suppres- 
sion (-mean) and variation (SD of growth rates) found a consis- 
tent positive correlation. Simulation of our multi-dimensional 
fitness model with varying N values produced a similar correla- 
tion pattern (Figures 2C and 2D). Cell lines from other solid 
tumors (e.g., central nervous system, skin, and lung; Barretina 
et al., 2012) also demonstrated a similar trend (Figure S2). 

Although model simulations showed that for a given stress (X), 
a group of aneuploids may be selected for superior fitness, they 
also predict that if a second stress (Y) is applied to shift the 
optimal fitness in the direction opposite from stress X, those 
selected aneuploids would fall into the death zone in the 
fitness landscape (Figures 3A and 3B). This implies that if a 
highly evolvable population with heterogeneous random karyo- 
types can be “channeled” toward a certain karyotypic charac- 
teristic under a designed selection (stress X), thus drastically 
“shrinking” the population’s evolvability, a second treatment 
(stress Y) may be added to eradicate the selected singular or 
limited karyotypes. Together these two treatments would form 
an “evolutionary trap” (ET) on the elusive aneuploid population. 
Supporting this idea, for a given specific aneuploid yeast strain, 
in most cases one or multiple conditions could be found that 
caused >80% growth inhibition compared to the euploid control 
(Figure S1G). 
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Figure 1. Scaling of Phenotypic Variation 
with Growth Suppression in Aneuploids un- 
der Diverse Stress Conditions 

(A) The growth of 38 aneuploid strains relative to 
the euploid, as log 2 ratio of aneuploid growth (OD 
increase) over the euploid with the nearest ploidy 
(see Extended Experimental Procedures), is bin- 
ned by growth conditions. Each point in a boxplot 
represents an aneuploid strain. The half-length of 
each box represents the SD of relative growth 
among aneuploids (a), and the middle line repre- 
sents the average (|i). Note that the horizontal 
dashed line across 0 represents the euploid 
control. 

(B) Phenotypic variation among the aneuploids, 
measured as SD of relative growth (a), scales with 
average growth defect of the aneuploid cohort 
across diverse stress conditions (-|i). 

(C) The growth of 12 disomy strains relative to the 
haploid control under increasing concentrations of 
hygromycin B (Hygro), amphotericin B (Amph), or 
fluconazole (FL). Boxplot representation is as 
described for (A). 

(D and E) Schematic representation of the model is 
shown for the simple case of N = 2 with axes as 
labeled. Deep blue to deep red code for increasing 
fitness. (D) Graph represents the stress-free con- 
dition, where the euploid that is located at the 
center of the activity field (position m) assumes the 
highest fitness. (E) Graph represents a stress con- 
dition, where the optimal fitness point shifts from 
m to m k j, reflecting the activity change necessary 
for adaptation. Consequently, the euploid (located 
at point m) no longer holds maximal fitness, 
whereas higher fitness is assumed by certain 
aneuploids (those occupying redder regions). 

(F) Example simulation results of the model for 50 
random aneuploids under diverse stress condi- 
tions (governed by varying type k and magnitude I) 
for a 24 dimension space (N = 24), with relative 
growth displayed as the experimental data in (A). 
The red line shows average log 2 growth ratio from 
the simulated aneuploid population. Note the 
appearance of adaptive aneuploids under high- 
stress conditions (toward the right of the graph). 

(G) Simulations of the model with a wide range of 
N values demonstrate the positive correlation 
between a and in various numbers of di- 
mensions. The simulated correlations are shown 
in colored lines, whereas the experimental data 
are overlaid in gray. 

See also Figure SI . 



Finding Drugs Forming an ET against Heterogeneous 
Aneuploid Budding Yeast 

To investigate the validity of ET against heterogeneous aneu- 
ploids, we first constructed a yeast population with high-degree 
karyotypic diversity by sporulation of a homozygous pentaploid 
yeast strain. Twenty-eight viable aneuploid meiotic products 
with ploidy above 1 .9N were mixed with isogenic diploid, triploid, 
and tetraploid cells to mimic karyotype heterogeneity observed 
in pathogenic fungi (Harrison et al., 2014) or human cancers (Mi- 
telman et al., 2012). Because most aneuploid karyotypes are un- 
stable (St Charles et al., 2010; Pavelka et al., 2010; Zhu et al., 



2012), the degree of heterogeneity of this mixed population (het- 
erogeneous mix) is expected to be far greater, as confirmed by a 
broad DNA content profile that did not show bias toward any 
specific chromosome gain or loss (Figure S3). To find a selective 
condition for karyotype channeling, we took advantage of the 
previous finding that growth in high concentrations of radicicol, 
an inhibitor of Hsp90 chaperone, selects for extra chromosome 
(chr) XV (Chen et al., 2012a). Treatment of three independent 
populations of the heterogeneous mix with 50 (ig/ml radicicol 
all predictably led to convergent evolution toward chrXV gain 
(Figures S3C-S3E). Six individual colonies derived from a culture 
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Figure 2. Phenotypic Heterogeneity of Human Cancer Cell Lines Positively Correlates with Increasing Growth Suppression by Therapeutic 
Compounds 

The relationship between phenotypic variation (SD of growth rates) and average growth suppression was examined using published drug-response profiling data 
from 54 breast cancer cell lines treated with 77 different potential therapeutic compounds over 10 different concentrations (see Extended Experimental 
Procedures). 

(A) Histograms showing average growth suppression under different drug concentrations. 

(B) The SD of growth rates caused by each drug under the range of concentrations tested are shown as a heatmap. Note that drug concentrations for (A) and (B) 
are aligned, showing the general trend of increasing SD with increasing growth suppression. The clustering is based on Euclidean distances. 

(legend continued on next page) 
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that had grown to saturation in radicicol-containing medium 
were found to vary in karyotypes but all shared chrXV gain, con- 
firming that radicicol caused a selective sweep in the heteroge- 
neous mix (Figure S3F). 

We next performed a screen for chemicals particularly effec- 
tive against chrXV trisomy and identified hygromycin B, a trans- 
lation inhibitor (Singh et al., 1 979), as the most potent inhibitor of 
chrXV trisomy relative to its effect on the diploid control (Fig- 
ure S4). We further confirmed potent growth inhibition by 
hygromycin B for all six different karyotypes identified from rad- 
icicol-selected cultures sharing chrXV gain (Figure 3C). We note 
that even though hygromycin B effectively suppressed the 
growth of aneuploids that had gained chrXV, this effect did not 
extend to all aneuploids: aneuploid strains with increased chrll 
or IX dosage but having a normal dosage of chrXV were associ- 
ated with superior resistance to hygromycin B compared to the 
euploid control (Figures 3D and 3E). However, strains with com- 
binations of chrXV, II, and IX gains were highly sensitive rather 
than resistant to hygromycin B (Figure 3E). 

We showed previously that dosage increase of ST1 1 and PDR5, 
two genes present on chrXV, are “driver mutations” underlying 
chrXV gain-associated radicicol resistance (Chen et al., 2012a). 
However, hygromycin B sensitivity was unrelated to dosage alter- 
ations of these two genes (Figures 4A and 4B). To identify genes 
on chrXV whose increased copy number could cause hypersen- 
sitivity to hygromycin B, we screened each of the 453 genes 
located on chrXV with its original promoter and carried on a 
low-copy (centromeric) plasmid (Ho et al., 2009) and identified 5 
genes (CRS5, RPS15, TRM11, RRP6, SERI) (Figures 4C, S5A, 
and S5B). Integration of one copy of each top three hit (CRS5, 
RPS15, TRM11) recapitulated up to 50% of the hygromycin B 
sensitivity of chrXV trisomy in a diploid background (Figure 4E). 
However, the effects of these genes were not additive (data not 
shown), suggesting that the hygromycin B sensitivity associated 
with chrXV gain is an emergent property of gaining many genes 
carried on this chromosome beyond the three found in the screen. 
Consistent with this notion, deletion of a copy of each of the three 
hits (CRS5, RPS15, TRM1 1) individually did not change hygrom- 
ycin hypersensitivity of chrXV trisomy (Figure 4F). 

Hygromycin B and other proteotoxic agents were previously 
proposed to enhance the protein quality-control deficit due to 
increased expression of a large number of genes (Oromendia 
et al., 2012; Torres et al., 2007). However, the genes whose 
dosage increase gave rise to hygromycin B sensitivity were not 
all highly expressed genes compared to the other genes located 
on chrXV in either euploid or aneuploid with chrXV gain (p = 0.56 
for transcripts, p = 0.43 for proteins, Mann-Whitney U test) (Fig- 
ures 4C, 4D, and S5C). RPS15, the only highly expressed gene 
among the hits, encodes one of 30 proteins within the small ribo- 
somal subunit to which hygromycin B binds (Borovinskaya et al., 
2008), suggesting that a direct gene-specific cause underlies 



some of the drug sensitivity. Taken together, different molecular 
mechanisms are likely to account for radicicol resistance and hy- 
gromycin hypersensitivity of aneuploid strains with chrXV gain. 

Evolution Dynamics of Cell Populations under Single- 
Drug Treatment versus ET 

Even though hygromycin B alone potently suppressed the 
growth of the chrXV trisomy strain initially, the treated cell popu- 
lation eventually adapted, as indicated by growth takeoff after 
~50 hr (Figure 5A). Karyotyping of the adapted chrXV trisomy cul- 
ture revealed that the population was a mixture of euploid and 
aneuploid cells that no longer had chrXV gain, but some now car- 
ried an additional copy of chrIX (Figures 5B and 5C), consistent 
with the result shown in Figures 3D and 3E. This observation re- 
confirmed the requirement for chrXV gain in hygromycin B sensi- 
tivity and demonstrated the ability of the population to escape 
single-drug treatment by continued karyotype change. As ex- 
pected, the condition that selected for chrXV, i.e., 50 jig/ml rad- 
icicol, was highly toxic to all those survivors that had become 
hygromycin B resistant because they had lost the gained chrXV 
(Figures 5D and 3E), supporting the rationale of combinatorial 
treatment with both radicicol and hygromycin B. 

Indeed, the combination of both drugs led to the extinction of 
all three independent cultures first subjected to the radicicol 
selection (Figures 5F-5H). The same drug pair was similarly 
effective against the heterogeneous aneuploid population (het- 
erogeneous mix) when added simultaneously (Figures S6A- 
S6E). When the heterogeneous mix was treated with a single 
chemical, 10 out of 21 tested chemicals (for example, 
0.08 jig/ml menadoine and 100 jig/ml radicicol) imposed stron- 
ger growth suppression than 50 |ig/ml hygromycin B (Fig- 
ure S6C). Hygromycin B also ranked low in suppressing euploid 
growth (Figure S6A). Yet, when combined with 50 jig/ml radici- 
col, hygromycin B, but not menadoine or radicicol, led to the 
extinction of the heterogeneous mix (Figure S6D). Thus, the 
opposing selective effects of these two drugs on the channeled 
karyotype (chrXV gain) imposed an adaptive dilemma for the het- 
erogeneous population, leading to its extinction. A potential 
pitfall of this ET drug pair is that the diploid with an additional 
copy of STI1 showed an enhanced level of resistance against 
50 jig/ml radicicol yet without causing hypersensitivity to 
50 (ig/ml hygromycin (Figure 4A). Such effects due to potential 
single-gene gain could be remedied by increasing the concen- 
tration (from 50 to 1 00 jig/ml) of radicicol (Figures S6F and S6G). 

An Azole-Based ET for Human Pathogen Candida 
albicans 

We tested the possibility of using the ET strategy for more effec- 
tive anti-fungal treatment. It was previously shown that a 
mechanism for human pathogen Candida albicans to confer 
resistance to fluconazole, a first-line medicine treating invasive 



(C) At three concentrations that considerably reduced the overall growth rate (>50% decrease compared to no drug control in A), the general correlation between 
[x and a across different growth conditions is examined. The linear regression line (red) is surrounded by 95 percentile confidence fitting intervals (darker gray 
area). Note similar fitting parameters across different drug doses. 

(D) The correlation between SD and average growth suppression is also recapitulated by simulations of the multi-dimensional model with the number of pathways 
(N) in the range of 48-96. The modeled fitting is shown in colored lines, whereas the published experimental data are shown in gray. 

See also Figure S2. 
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candidiasis in immune-compromised patients, is gain of isochro- 
mosome 5L (i(5L), which contains two copies of the left arm of 
chr5) (Selmecki et al., 2006). This aneuploid feature was also 
recapitulated in a laboratory evolution experiment selecting for 
fluconazole-resistant C. albicans (Selmecki et al., 2009). Thus, 
fluconazole may serve as the “selection” drug, which would be 
used in combination with a second drug specifically targeting 
i(5L) gain. To this end, we screened a chemical library that con- 
tained 1,713 FDA- or other regulatory agency-approved drugs, 
580 natural compounds, and 420 other bioactive agents against 
an i(5L)-containing C. albicans isolate from a 30-year-old male 
who developed fluconazole resistance (Marr et al., 1997; Sel- 
mecki et al., 2006). We looked for chemicals that showed 
much elevated potency against the i(5L)-containing strain 
compared to the euploid control generated by spontaneous 
i(5L) loss of the original i(5L)-containing strain (Selmecki et al., 
2008). Such drugs were likely to be missed in previous screens 
against euploid C. albicans (Okoli et al., 2009; Spitzer et al., 
2011 ). 

A primary screen found that 1 00 out of 2,713 compounds (Fig- 
ure S7A) caused at least 80% growth suppression to either the 
i(5L) strain or the euploid, or both. For this set of compounds, 
we then determined the concentration causing 80% growth sup- 
pression (IC 8 o)- In the presence of 26 or 11 conditions, the i(5L) 
strain showed significantly (Z test, p < 0.05) higher (i.e., more 
resistant) or lower (more susceptible), respectively, IC 80 than 
the euploid (Figure 6A). As expected, the i(5L) strain exhibited 
increased resistance toward a panel of seven other azole deriv- 
atives. By contrast, pyrvinium pamoate (PP), a medication for 
pediatric pinworm infection, strongly suppressed the growth of 
the i(5L) strain but was ineffective against the euploid even at 
high concentrations as high as 80 jiM after 48 hr culture on 
agar (Figures 6A, 6B, and S7B). Given that euploid and i(5L)- 
gained C. albicans could co-exist in patients treated with flucon- 
azole (Selmecki et al., 2008), we tested the combinatorial effect 
of PP with fluconazole in a 1:1 mixed population of both karyo- 
types. At concentrations of PP that suppressed the growth of 
the i(5L) strain, the resistance to fluconazole regressed from 
over 256 (ig/ml back to the same level exhibited by the euploid 
without PP (Figures 6C and 6D), supporting the ET rationale for 
combinatorial anti-fungal treatment. 



A Potential ET against EGFR-Driven Tumors such as 
Glioblastoma 

In many human cancers, epidermal growth factor receptor 
(EGFR) signaling is a major contributing factor to aggressive phe- 
notypes (Wheeler et al., 201 0). The human EGFR gene is located 
on the short arm of chr7 (chr7p), and chr7p gain is a prevalent 
karyotypic feature across diverse types of human cancers (Ber- 
oukhim et al., 201 0). In particular, chr7 gain was observed in 80% 
of glioblastoma samples collected from The Cancer Genome 
Atlas Pilot Project (TCGA) cohort of 219 patients (Bredel et al., 
2009; Network, 2008). Within individual tumors, arm-level (mainly 
chr7p) or whole-chromosome gain of chr7p was detected in 
nearly all surgical sections examined, suggesting that this event, 
unlike other heterogeneous genetic alterations, is an early critical 
event in clonal expansion (Sottoriva et al., 2013). In addition to 
being a likely product of the selection for EGFR-dependent ma- 
lignant transformation or adaptation, chr7p polysomy was also 
reported in 80% of tumor samples from lung cancer patients 
that acquired resistance to gefitinib or erlotinib, EGFR kinase in- 
hibitors (Bean et al., 2007). Thus, in designing an ET against 
EGFR signaling-dependent tumors, gefitinib or erlotinib may be 
used as the channeling drug that could impose a pharmacolog- 
ical selection for chr7p gain beyond the selection during tumor- 
igenesis. To find the pairing drug that may be particularly toxic 
toward chr7p gain, we analyzed the pharmacogenomics data 
(Barretina et al., 2012) by correlating the dosage of chr7p of 29 
central nervous system (CNS) tumor cell lines with the response 
(IC 50 ) to 23 therapeutic compounds (Figure 7A). This analysis 
identified a significant correlation between increased chr7p 
dosage and increased sensitivity to irinotecan, a drug that is 
FDA approved for treating colon cancer but not yet for glioblas- 
toma (Figure 7B). This result suggests that gefitinib or erlotinib 
together with irinotecan may form an ET against EGFR-driven tu- 
mors, especially glioblastoma where chr7p is already highly en- 
riched even without drug treatment (Figure 7E). 

DISCUSSION 

The analyses presented above started from an exploration of the 
general response of karyotypically heterogeneous cell popula- 
tions to diverse stress conditions and proceeded to the design 



Figure 3. Design and Experimental Implementation of ET in Budding Yeast 

(A and B) Model simulations predict that cell population adapted to a specific stress (X) through karyotype channeling can be highly targetable by a stress Y that 
shifts optimal fitness in the direction opposite to X, but not by a second stress in a random direction. (A) Schematic representations of the fitness landscape in a 
simplified 2D example similar to that in Figures ID and IE but projected to the plane defined by pathway activities. (B) Results of model simulations in a high- 
dimensional fitness space under conditions indicated in (A). Note that only the aneuploids selected by Stress X (the top 5% adaptive ones) are shown. A total of 
1 ,000 cells were simulated. Each dot represents the relative fitness of an aneuploid cell compared to the euploid. Death zone was defined as having negative 
growth value. 

(C) Six independent colonies isolated from radicicol-adapted population with gain of chrXV (as shown in Figure S3F) were grown under indicated conditions 
until saturation was reached in the fastest growing strain. Flistograms show average amount of growth normalized to euploid and SEM derived from four 
replicates. 

(D) chrll and chrIX disomy strains generated previously by genetic manipulation (Torres et al., 2007) exhibit resistance to hygromycin B yet are sensitive to 
radicicol. 

(E) Aneuploid strains generated by random triploid meiotic segregation with indicated karyotypic features were cultured in different concentrations of hygromycin 
B. Boxplots show growth relative to the euploid control with each dot representing an aneuploid strain. Karyotypes are categorized by their states of chrll/IX/XV 
dosage, but other chromosome aneuploidy may also be present in these strains. The amount of growth (OD increase) was normalized to the euploid with the 
nearest ploidy. The dashed line represents the average of normalized controls. ** indicates p < 0.01 according to Mann-Whitney U test. 

See also Figures S3 and S4. 
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Figure 4. Different Sets of Genes on chrXV Cause Radicicol Resistance or Hygromycin B Sensitivity when Increased in Copy Number 

(A and B) Copy-number gain (+, by genomic integration) or loss (-, by genomic deletion) assays showing that increased copy numbers of STI1 and PDR5, which 
are both critical for radicicol resistance, do not contribute to the hygromycin B hypersensitivity. Relative growth compared to the diploid control is reported in 
bar plots with the SEM derived from three replicates. Asterisks denote significant difference from the corresponding control (the diploid or chrXV trisomy [Tri-XV]) 
(*p < 0.05; **p < 0.01 ; two-tailed t test). 

(legend continued on next page) 
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of a strategy to extinct such cell populations that accounts for 
their adaptive potential. Growth profiling of yeast aneuploids 
demonstrated that the phenotypic variation resulting from karyo- 
type diversity scales directly with the degree of average inhibition 
of population growth under diverse stress conditions. Mathe- 
matical simulations based upon a few simple assumptions 
support the generality of our experimental observation. A more 
intuitive understanding of this phenomenon may lie in the highly 
pleiotropic nature of aneuploidy. The finding from yeast aneu- 
ploids was also recapitulated in human cancer cell lines derived 
from highly aneuploid solid tumors: the growth-suppressive 
effects of a majority of therapeutic compounds also significantly 
correlated with phenotypic heterogeneity in a positive manner. 
This correlation predicts that a heterogeneous aneuploid cancer 
population may harbor large phenotypic variation under stress, 
fueling rapid adaptive evolution. Frequent association between 
genetic heterogeneity and poor prognosis has indeed been 
observed in cancer clinics (Maley et al., 2006; McGranahan 
et al., 2012). Chromosomal instability (CIN) in mice also pro- 
motes tumor development (Sotillo et al., 2007; Weaver et al., 
2007), although the extent to which genetic heterogeneity con- 
tributes to tumorigenesis in common mouse cancer models 
remains to be further explored. Beyond aneuploidy, the wide 
spectrum and large number of genetic modifications in a given 
cancer genome are likely to augment the pleiotropic effects on 
cellular pathways caused by aneuploidy. 

Our experimental and theoretical findings argue on a general 
level that the difficulty of suppressing karyotypically heteroge- 
neous cell populations is rooted in the large adaptive potential 
in the presence of severe stress. However, our findings do not 
rule out the possible existence of agents with broad inhibitory 
effects against aneuploids, especially when the karyotypic 
space is limited. The principle of ET in fact capitalizes on this 
notion by reducing karyotypic heterogeneity to an exceedingly 
narrow space through an evolutionary process. In our proof- 
of-principle experiments in budding yeast, this confined karyo- 
typic space is simply gain of chrXV, which harbors the “driver 
mutations” for radicicol resistance (selection), whereas multi- 
ple “passenger” genes on chrXV confer enhanced sensitivity 
to hygromycin B, the agent for extinction, when increased in 
copy number (Figures 7 C and 7D). The complex contribution 
of gene dosage to hygromycin B sensitivity and the requirement 
for multiple genes to achieve radicicol resistance predicted 
that an efficient escape from the ET is not easily attainable 
through single-gene mutations or copy-number changes. 
Although ET is not presumed to be an omnipotent solution for 
cancer or fungal infections, we argue that, by accounting for 



the evolutionary trajectory of a cell population, ET may substan- 
tially reduce the risk of evolved drug resistance and disease 
relapse. 

The principle of ET is distinct from the idea of targeting a sin- 
gle characteristic trait of aneuploids. We note that both drugs 
used in the yeast ET, radicicol and hygromycin B, can perturb 
proteome homeostasis (Oromendia et al., 2012; Singh et al., 
1979; Taipale et al., 2010; Torres et al., 2007), but the responses 
of aneuploids to these drugs are karyotype specific rather than 
uniform. The rationale behind the dual drug treatments associ- 
ated with ET anticipates evolutionary changes and is thus 
fundamentally different from the idea of synthetic lethality (Kae- 
lin, 2005; Luo et al., 2009) or the use of dual drugs to target 
different aspects of a common defect of aneuploidy (Tang 
et al., 2011). ET drugs are not chosen to interfere with redundant 
processes but rather collaborate to force and intercept a pre- 
dicted evolutionary trajectory of the genome. The same karyo- 
type feature is selected by one drug but rejected by the other, 
as a result of linkage of two pools of genes, which respond 
oppositely to the two different drugs, on the same chromosome 
(Figure 1C). 

Our experiments with Candida provide a possible ET example 
where PP, which selects against the gain of i(5L), could enhance 
the efficacy of fluconazole, the prior selection favoring i(5L) 
gain. PP is extremely well tolerated in the pediatric population 
for gastrointestinal treatment due to the near-zero absorption 
rate (Smith et al., 1976). Thus, PP may be used to augment 
fluconazole if the developed azole resistance is determined 
to be associated with i(5L) gain variants originated from gut 
flora. 

Our analysis of the cancer pharmacogenomics data suggests 
that irinotecan may form an ET with EGFR inhibitors against hu- 
man cancers such as glioblastoma. Unlike other tumor types 
dependent on EGF signaling, glioblastoma responds poorly to 
EGFR inhibitors with the erlotinib response rate at <25% (Taylor 
et al., 2012). We envision that treatment with an EGFR inhibitor 
may further select for and thus increase the percentage of 
chr7-gained cells within the tumor mass. Supporting this idea, 
it was reported that EGF limitation selects for chr7 gain in human 
neural stem cells (Sareen et al., 2009). By analyzing existing 
pharmacogenomics data in brain tumor cell lines, we identified 
irinotecan, an FDA-approved chemical that targets topoisomer- 
ase 1 , as a drug whose potency significantly correlates with chr7 
gain. Additional experiments will be required to further confirm 
this correlation and test the efficacy of an ET formed with 
a combination of gefitinib or erlotinib with irinotecan against 
glioblastoma. 



(C and D) Each of 453 genes located on chrXV was transformed into a diploid strain, and Z scores denoting the deviation of growth of each strain from the 
population average in the presence of 35 ^g/ml hygromycin B were plotted against mRNA (C), using RNA-seq data, or protein expression abundance (D) 
(Ghaemmaghami et al., 2003) of each tested gene in the euploid S288c background. Note that protein abundance data were not retrieved for 30% genes 
(including CRS5 and RPS15). The gray area shows the 95% confidence interval for the linear fitting. 

(E) Growth assays showing that copy-number increases (by genomic integration) of three genes ( CRS5 , RPS15, TRM1 1) on chrXV were individually sufficient in a 
diploid euploid context to reproduce enhanced sensitivity to hygromycin B, but not radicicol resistance, contrasting copy-number increase for STI1 and PDR5 as 
shown in (A). 

(F) Growth assays showing that single-copy deletion of none of the three genes ( CRS5 , RPS15, TRM11) alone could rescue chrXV trisomy from hygromycin B 
hyper-sensitivity. 

See also Figure S5. 
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Figure 5. The Combination of Radicicol and Hygromycin B Extincts Karyotypically Heterogeneous Cell Population 

(A-E) ChrXV trisomy was able to escape growth inhibition by hygromycin B through loss of the gained chrXV. (A) The growth (represented by OD reading on a 
Tecan reader) of both the euploid control and the trisomy XV strain was monitored in media containing 50 pg/ml hygromycin B. (B) The additional copy of chrXV 
was lost in hygromycin B culture but not in YPD culture, as shown by the heatmap of karyotyping result of the final culture. (C) Karyotypes of six single colonies 
from the trisomy XV culture in YPD or hygromycin are shown, three of which were re-tested for growth in the presence of hygromycin B (D) or radicicol (E). Note 
that radicicol sensitivity was re-established in all three adapted colonies from the trisomy XV culture in hygromycin, whereas isolates from the YPD culture 
remained radicicol resistant. 

(F-H) Combination of hygromycin B and radicicol effectively eradicates the radicicol-preselected aneuploid population. (F) Growth curves (as OD600 measured in 
Tecan) of the diploid control strain under conditions as indicated. Note that 50 irg/ml hygromycin B alone had milder growth suppression compared to 100 [rg/ml 

(legend continued on next page) 
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Figure 6. PP Effectively Targets the Flucon- 
azole-Resistant Candida Aneuploid 

(A) Relative IC 80 (80% inhibitory concentration) of 
the diploid versus the i(5L) Candida strain for each of 
the hits of the primary drug screen (Figure S7A). The 
error bars reflect the model-based standard error of 
fitting reported by drc package in R programming. 

(B) Images of agar plates showing effectiveness of 
PP toward i(5L) Candida. 

(C) Resistance of the diploid, the i(5L), or the 
i(5L)+diploid mix population toward fluconazole. 

(D) PP at concentrations above 10 i^M restored the 
sensitivity of the i(5L)+diploid mix population toward 
fluconazole in the E-test, in accordance to its sin- 
gular form’s activity against the i(5L) strain shown in 
(B). Note that even though the initial plating density 
was the same, due to the inhibition of the i(5L) cells, 
the overall growth was less in (D) compared to (C). 
Note that our euploid strain also exhibited a reduced 
susceptibility to fluconazole compared to the clin- 
ical E-test standard strain, which may be attributed 
to other point mutations (such as the hyperactive 
TAC1) within this strain (Selmecki et al. , 2006). All 
plate images were taken after 48 hr culture. 

See also Figure S7. 



0.1 65M MOPS buffer + 0.2% glucose without 
bicarbonate (Lonza) was used for 96-well culture 
according to the Clinical Laboratory and Stan- 
dards Institute (CLSI) standard M27-A3. For 
384-well cultures, an additional 2% glucose was 
added to accelerate the growth. Agar plates 
were prepared using RPMI1640 with MOPS 
without bicarbonate powder (US Biological) sup- 
plemented with 2% glucose and 1.5% agarose, 
following the E-test strip manufacturer’s guideline 
(bioMerieux). 



D 




EXPERIMENTAL PROCEDURES 
Cell Culture 

Budding yeast (strains are listed in Table SI, and plasmids/primers used to 
construct these strains are listed in Tables S2 and S3) was cultured using 
standard media. C. albicans cells recovered from frozen glycerol stocks 
were grown on YPD plates. For bulk liquid culture prior to drug-sensitivity 
assays, SC media with additional 80 mg/I uridine were used. RPMI1640 + 



Yeast Growth Assays 

For continuous OD monitoring (e.g., Figure 3D), 
cultures were set up in 96-well plates sealed 
with parafilm in a Tecan M200Pro reader with 
orbital shaking. OD at wavelength 595 nm 
was taken every 15 min and analyzed by using 
the Magellan 7 software (Tecan). For cultures 
whose OD reading was taken intermittently 
every several hours (e.g., Figure 5F), the culture 
plates were placed in a non-shaking incubator 
within a humid chamber before each reading 
after agitation. The OD at wavelength 595 nm 
was recorded by using a Tecan M200Pro reader, and the data files were pro- 
cessed in R. The growth assays lasted until the fastest growing culture 
reached saturation, at which time the last OD readings of all strains were 
recorded. 

The drug concentrations of hygromycin B and radicicol were adjusted for 
different media (e.g., YPD versus SC-ura) and/or the strain background, so 
that the wild-type control showed the same growth delay as was observed 
for the strain RLY2628 in YPD media containing the stated drug concentration. 






radicicol. (G) Growth curves of three populations pre-selected independently in the presence of radicicol (Figure S3E) under indicated conditions. (FI) Growth 
curves of the same three populations as in (G) under indicated conditions where each drug was combined with 50 |ag/ml radicicol. Each data point in (G) and (FI) 
shows the mean and SEM from three experiments. 

See also Figure S6. 
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Figure 7. A Potential Drug Targeting chr7 
Gain in Brain Cancer and Schematic Sum- 
mary of the Mechanism and Principle of ET 
using the Yeast Example 

(A) Correlation coefficients of drug response (IC 50 ) 
with chr7p dosage in 29 CNS tumor cell lines 
across 23 different therapeutic compounds were 
plotted as bar graphs. 

(B) The dot plot illustrates the details of the cor- 
relation between dosage of chr7p and sensitivity to 
irinotecan, with each dot showing the drug 
response and chr7p dosage of each cell line. The 
red line shows linear fitting, and the gray area 
shows the fitting range with 95% confidence in- 
terval. Note that a total of 20 cells lines were 
included here, as the IC 50 data for nine cell lines 
were not available for irinotecan. 

(C) The molecular makeup of the ET against 
aneuploidy yeast. 

(D) Schematic summary of opposing selective 
effects of radicicol and hygromycin B on chrXV 
gain impose an adaptive dilemma for the yeast 
heterogeneous aneuploid population. 

(E) An ET may be established against glioblastoma 
by opposing selective effects on chr7p gain. 





The growth of genetic variants was normalized to the corresponding wild-type 
controls. 

A comprehensive description of all methods used can be found in Extended 
Experimental Procedures. 
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SUMMARY 

Generation of potent antibodies by a mutation- 
selection process called affinity maturation is a key 
component of effective immune responses. Anti- 
bodies that protect against highly mutable patho- 
gens must neutralize diverse strains. Developing 
effective immunization strategies to drive their evolu- 
tion requires understanding how affinity maturation 
happens in an environment where variants of the 
same antigen are present. We present an in silico 
model of affinity maturation driven by antigen vari- 
ants which reveals that induction of cross-reactive 
antibodies often occurs with low probability because 
conflicting selection forces, imposed by different an- 
tigen variants, can frustrate affinity maturation. We 
describe how variables such as temporal pattern of 
antigen administration influence the outcome of 
this frustrated evolutionary process. Our calculations 
predict, and experiments in mice with variant gp120 
constructs of the HIV envelope protein confirm, that 
sequential immunization with antigen variants is 
preferred over a cocktail for induction of cross-reac- 
tive antibodies focused on the shared CD4 binding 
site epitope. 

INTRODUCTION 

Antibodies (Abs) with high affinity for antigen are produced by the 
process of affinity maturation (AM), which takes place in germinal 
centers (GCs). GCs are dynamic structures within secondary 
lymphoid tissues that arise in response to antigen stimulation 

(Shlomchik and Weisel, 2012; Victora and Nussenzweig, 2012). 
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GCs house B cells, antigen-specific T helper cells that develop 
in concert with GC B cells (Baumjohann et al., 2013; Kelsoe, 
1996) and antigens presented on follicular dendritic cells 
(FDCs) (Figure 1A). GC B cells enhance the antigen affinity of 
their receptors by 10- to 1,000-fold through cycles of mutation 
and selection against antigens presented on FDCs, a Darwinian 
evolutionary process that occurs on a very short timescale. Sol- 
uble forms of the high-affinity receptors are potent Abs. AM has 
been studied extensively using diverse experimental methods 
(Batista and Neuberger, 1998; Berek and Milstein, 1987; Berek 
et al., 1991; Eisen and Siskind, 1964; Jacob et al., 1991; Kocks 
and Rajewsky, 1988), mathematical models (Deem and Lee, 
2003; Kepler and Perelson, 1993; Meyer-Hermann, 2002, 
Meyer-Hermann et al., 2006; Oprea and Perelson, 1997; Zhang 
and Shakhnovich, 2010), and computer simulations (Ke§mir 
and De Boer, 2003; Shlomchik et al., 1998; Swerdlin et al., 
2008). Recent experiments have uncovered new aspects of 
GC dynamics (Allen et al., 2007; Shulman et al., 2013; Victora 
et al., 2010). 

Effective Ab responses are likely to be required for a protective 
prophylactic vaccine against highly mutable pathogens. For HIV, 
a quintessential example of such a pathogen, there has been no 
definitive success in designing such a vaccine, despite extensive 
efforts (Burton et al., 201 2; Klein et al., 201 3; Kwong et al., 201 3; 
Mascola and Haynes, 2013). This is in large part due to the vari- 
ability of HIV (Burton et al., 2012; Mascola and Haynes, 2013). 

The protective effects of an Ab are predicated on its ability to 
bind to a set of residues (the epitope) on the surface of virions. 
For example, the HIV-1 envelope glycoprotein trimer (Env) is 
the sole target of known HIV-1 neutralizing Abs (Burton et al., 
2012; Kwong et al., 2013; Mascola and Haynes, 2013; West 
et al., 201 4). During the first few months of infection, the induced 
neutralizing Abs are primarily strain-specific and target variable 
regions of Env. As the concentrations of such strain specific 
Abs increase, neutralization escape variants with mutations 
in the variable epitopes are selected. A pathogen’s molecular 
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Figure 1. Schematic Depiction of In Silico Model 

(A) Players and processes in the GCR. 

(B) Major steps in our in silico model of the GCR. 

(C) Model for BCR-Ag interactions. Left: A BCR interacting with an FDC-held Ag. Right: A zoom-in view of interactions (bars) between the residues on the BCR 
paratope and those on the Ag epitope. An affinity-affecting mutation on a paratope residue will change its interaction strength h with the corresponding epitope 
residue, denoted by h c (h v ) if the latter is conserved (variable). 

(D) Conformational coupling between residues on the BCR is incorporated via correlated changes in h c and h v ; weakening interaction with the variable residues 
and the residues that shield the conserved residues of the epitope (red symbols), i.e., h v i, would facilitate access to the conserved residues (green symbols), 
i.e., h c f . 

See also Figure SI . 



surfaces that interact with host receptors (e.g., the CD4 binding 
site) provide epitopes that contain a relatively conserved set of 
residues. They can serve as targets for Abs that are able to 
neutralize a greater diversity of HIV strains. A shield of glycans 
and immunodominant variable loops can restrict antibody ac- 
cess to these relatively conserved epitopes (Julien et al., 2013; 
Kwong et al., 2002; Lyumkis et al., 2013; Pancera et al., 2014; 
Wei et al., 2003; Wyatt et al., 1998). Yet, some HIV-1 infected in- 
dividuals do develop Abs that focus on such epitopes and 
neutralize a broad cross section of HIV strains in vitro (Kwong 
et al., 2013). However, these broadly neutralizing antibodies 
(bnAbs) are produced only beginning around 2 years after infec- 
tion and in only a limited fraction of chronically infected patients. 
Nonetheless, the emergence of bnAbs in some patients is proof 
that AM can result in bnAbs. This suggests the tantalizing possi- 
bility that appropriately designed immunogens and immuniza- 
tion protocols may be able to elicit bnAbs rapidly in a large frac- 
tion of individuals, thus resulting in an effective vaccine. 

Recent longitudinal tracking of a developing HIV-1 bnAb line- 
age and the co-evolving virus in a patient showed that extensive, 



and even specifically directed, viral diversification occurred prior 
to the development of breadth (cross-reactive Abs) (Gao et al., 
201 4; Liao et al., 201 3), supporting the idea that escape mutants 
of the virus drive bnAb evolution. More importantly, this observa- 
tion highlights that induction of bnAbs will likely require immuni- 
zation with multiple variants of the antigen. 

When multiple complex antigen variants are used as immuno- 
gens, several new questions become important, for example: (1) 
which antigen variants should be used as immunogens; (2) what 
should be the concentrations and temporal order in which they 
are administered (e.g., cocktails versus sequential)? The an- 
swers to these questions are drawn from a huge number of 
possibilities, and random selection from a large number of com- 
binations of options may not allow sufficient sampling to find effi- 
cacious strategies. Intuition is unlikely to guide choices correctly 
because a mechanistic understanding of how AM occurs in the 
face of variant or mutating antigens is not available, as past 
studies have focused on AM in response to single model anti- 
gens. This gap in basic immunobiology needs to be addressed. 
A fundamental understanding of AM induced by several antigen 
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variants that mimic the complexity of how conserved epitopes 
are shielded on the intact viral spike could be harnessed to 
design optimal immunogens and immunization protocols for 
the development of universal vaccines against highly mutable 
pathogens (e.g., HIV-1, influenza). 

To take steps toward these goals, we developed an in silico 
model of AM induced by variant antigens. This stochastic dy- 
namic model enables examination of the key mechanisms and 
factors that influence AM in the face of variant antigens and 
the development of bnAbs. Our calculations predict markedly 
distinct outcomes if designed antigen (Ag) variants are presented 
in different concentrations and temporal patterns during im- 
munization. Experiments in mice using model HIV antigens are 
consistent with these predictions. Thus, our complementary 
computational and experimental results provide broadly appli- 
cable fundamental insights, and a guide for further studies aimed 
at overcoming roadblocks to the induction of bnAbs against HIV- 
1 by vaccination. 

RESULTS 

In Silico Model 

The purpose of our in silico modeling is not quantitative recapitu- 
lation of existing experiments but to provide fundamental mech- 
anistic insights into AM induced by multiple Ag variants and to 
compare the predicted relative efficacy of different immunization 
schemes in inducing cross-reactive broadly neutralizing Abs 
(bnAbs). Two key features not considered before must be incor- 
porated explicitly in such studies; viz., the presence of multiple Ag 
variants during GC reactions and the molecular complexity of 
the Ags. To mimic interactions of B cell receptors with complex 
immunogens such as the HIV-1 trimeric spike, we account for 
the fact that the conserved protein epitope of desired Abs are 
partially shielded, and that insertions and deletions in the variable 
loops can hinder the formation of strong interactions with the 
conserved residues of the epitopes (e.g., those in the CD4bs). 

We computationally simulate the dynamics of a typical GC 
(Figure 1 A). The GC reaction (GCR) starts on day 3 (Nieuwenhuis 
and Opstelten, 1984) after Ag injection, with three B cell blasts 
(Jacob et al., 1991; Kroese et al., 1987) that barely meet a rela- 
tively low-affinity threshold, E a . The qualitative results we report 
do not depend on the particular value of E a since all other affin- 
ities are scaled relative to it. The B cells expand without mutation, 
reaching a population size of about 1 ,500 cells around day 7 
(Jacob et al., 1991). 

Hypermutation in the Dark Zone of the GC 

At day 7, AID-mediated somatic hypermutation (SHM) in the 
immunogloblin (Ig) genes turns on (Kallberg et al., 1996) with a 
rate of 10 -3 per base pair per division (Berek and Milstein, 
1987). Variations in the time at which SHM begins do not affect 
our results as we start timing AM processes after this point. 
Each B cell in the dark zone of the GC (the region where this repli- 
cation/mutation occurs) replicates twice, and we assume that 
mutation occurs uniformly, neglecting preferential replacements 
or hypermutation hotspots (Wagner et al., 1 995). We assume that 
the activated B cells are from an appropriate germline (Kepler 
et al., 2014). The probability of a functionally silent mutation 
(no change in affinity) is p s = 0.5, the probability that a mutation 



is lethal (e.g., non-folding) is p L = 0.3, and the rest are affinity- 
affecting mutations (probability p A = 0.2) (Shlomchik et al., 
1998). For affinity-affecting mutations, the affinity of a BCR (/) 
with a particular type of antigen (/), E V] , is changed. The extent 
of the change is chosen from a probability distribution character- 
ized by a long tail of deleterious mutations that reduce affinity, as 
experimental data suggest that favorable mutations are less 
likely than deleterious ones (Figure SI A). Our BCR-Ag affinity 
model is defined in a later sub-section. 

Selection in the Light Zone of the GC 
B cells whose re-expressed surface Ig genes do not carry lethal 
mutations then go through affinity-dependent selection. Two 
survival signals are required for a B cell to be positively selected. 
First, the BCR on a B cell must bind to the Ag (immunogen) dis- 
played on the FDC sufficiently strongly to enable internalization. 
Since processes occur stochastically, in our simulations, B cells 
internalize Ag with a probability related to the binding affinity of 
its BCR for the Ag. The greater the difference between the affinity 
(Ey) of a B cell (/) for its epitope on Ag j and a threshold (E a ) 
required for Ag internalization, the greater the probability of Ag 
internalization. This probability is described mathematically in 
analogy with a Langmuir isotherm (Equation 1 in Experimental 
Procedures). 

B cells that internalize Ag display antigenic peptides (p) bound 
to major histocompatibility complex (MHC) molecules on their 
surface. T cell receptors expressed on T helper cells can bind 
to these pMHCs to deliver a key survival signal. B cells compete 
with each other for the limited availability of T cell help (Victora 
et al., 2010). T helper cells are specific for peptides derived 
from the Ag, but they are responsive to diverse pMHCs, not 
just those derived from the epitope targeted by the BCR. Thus, 
we assume that the probability of a B cell receiving T cell help 
is dictated by the amount of internalized Ag, regardless of the 
identity of its targeted epitope. A B cell that successfully internal- 
izes Ag receives T cell help with a probability that depends upon 
its probability of internalizing Ag relative to the average probabil- 
ity of internalizing Ag of all the other B cells present in the GC 
(Equation 2 in Experimental Procedures). A B cell with a high af- 
finity for its target epitope competes better for T cell help. 

To study AM against Ag variants that may be present simulta- 
neously, we have to confront an issue for which no experimental 
information exists: when a GC B cell encounters a FDC, with how 
many types of Ag variants can it simultaneously interact? We 
consider two scenarios: (1) each GC B cell interacts with one 
type of Ag variant during an encounter, or (2) each GC B cell in- 
teracts simultaneously with all Ag variants held on the FDCs. We 
find that which of these scenarios describes the heterogeneity of 
Ag display on FDCs strongly influences the outcome of AM, indi- 
cating the importance of experimental interrogation of this issue. 

We study two extreme cases of competition for T cell help: (1 ) 
GC B cells compete only with contemporary GC B cells (“peers 
only”) and (2) in addition to current GC B cells, all antibodies 
generated in previous rounds of mutation/selection also partici- 
pate in the competition (“Ab feedback”). Reality should be 
bounded by these scenarios. 

Recycling , Differentiation , and Termination of the GCR 

From seminal studies (Oprea and Perelson, 1997), 90% of 
the selected cells are recirculated to the dark zone. The rest 
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differentiate into equal numbers of Ab-secreting plasma cells 
and memory cells that can re-expand upon future activation. 
Ab feedback is provided by objects in our agent-based model 
with affinities representing these past GC emigrants. As AM pro- 
ceeds, at first, the number of B cells in the GC decreases. In 
some cases, all B cells apoptose and the GC extinguishes. In 
others, upon reaching a population bottleneck, favorable clones 
emerge, and the number of B cells rises (Zhang and Shakhno- 
vich, 2010). We terminate the GCR once the GC population re- 
covers to the initial size (i.e., ~1 ,500 cells), or when an assumed 
maximum duration of 120 days (or 240 GCR cycles) is reached, 
whichever comes first. The first condition reflects the fact that an 
abundance of GC B cells will internalize all the Ag on FDCs. The 
second condition may reflect antigen decay over time which we 
do not model explicitly. 

Model for BCR-Ag Affinity 

In classic computational studies focused on GCR stimulated by a 
single model Ag, B cells were binned into different affinity classes 
for the Ag (e.g., Kepler and Perelson, 1993). The correlated affin- 
ities for different Ag variants require a more detailed description. 
Other studies have considered mathematical models (such as 
NK-models) for BCR properties and their Ag affinity that present 
rugged landscapes for affinity evolution (Deem and Lee, 2003). 
To unambiguously define affinity to different Ag variants and to 
consider complex immunogens that reflect features on viral 
spikes, we developed a coarse-grained model with “residue- 
level” resolution for key BCR-Ag interactions (Figure 1C). Our 
model was inspired by the CD4bs on the trimeric HIV-1 viral spike, 
which is targeted by many monoclonal bnAbs for HIV-1 (Burton 
et al., 2012). But, it applies to other epitopes that contain highly 
conserved residues and can easily be modified to consider other 
pathogens. We ignore distracting epitopes that do not contain 
conserved residues because they are less likely to elicit bnAbs. 

BCRs can potentially make contacts with three types of resi- 
dues on the viral spike: (1) highly conserved residues of the 
epitope (18 such residues in model); (2) residues representing 
motifs, such as glycan attachment sites, which, when occupied, 
can be associated with sterically blocking access to the 
conserved residues (6 such residues in model); and (3) variable 
residues, which upon mutation (includes insertions/deletions), 
can further mask the conserved residues of the epitope. We 
include 22 such residues from the variable loops of Env that 
mimic the most mutable sites in the Seaman neutralization test 
panel (Seaman et al., 2010; Walker et al., 2011) sequences and 
publicly available Env sequences. 

Each epitope residue is described in a coarse grained manner, 
such that it is either the wild-type (WT) amino acid or a mutant. 
The strength of interaction of a residue (k) on BCR (/) with a res- 
idue on the epitope is denoted by h' k . The binding affinity, Ey, be- 
tween a B cell clone h and a viral strain ~s J is modeled as 

E,(h , s') = J>'4+ E "*■ (D 

k = ^ k = M+-\ 

The first M interactions (i.e., k < M) are with variable contact 
residues on the viral spike, where s k can be either 1 (WT) or -1 
(mutated). The other (N-M) sites (i.e., k>M) are conserved resi- 
dues on the epitope with s k = 1 . The interaction strength h k is 



drawn from a continuous and uniform distribution within a 
bounded range (details in “Simulation Methods” in Extended 
Experimental Procedures). 

Equation 1 is, however, just a starting point for the affinity be- 
tween a particular BCR and Ag, as BCRs do not interact with 
peptide chains in a linear fashion, and interactions between res- 
idues are not independent, as Equation 1 may imply. Epitope- 
paratope interactions are distinctly 3-dimensional, and structural 
aspects of CD4bs bnAbs also point to the importance of how in- 
teractions with some residues on the viral spike might influence 
interactions with other epitope residues (Zhou et al., 2010). For 
instance, most CD4bs bnAbs avoid contact with almost the 
entire VIA/2 loop except for a few conserved residues near the 
stem, and the very potent VRC01 Ab avoids the V5 loop. This 
suggests that avoiding interactions with some residues can allow 
better access and stronger interactions with the conserved res- 
idues of the epitope. Furthermore, affinity enhancement of 
bnAbs (such as VRC01 ) is influenced by alteration of non-contact 
residues (Klein et al., 2013). We account for these effects as 
described below (mathematical details in “Simulation Methods” 
in Extended Experimental Procedures). 

BCR mutations that strengthen (weaken) interactions with the 
residues that shield the conserved residues could result in a 
decreased (increased) binding strength (value of h in Equation 
1) for a randomly chosen paratope residue that can potentially 
interact with a conserved residue on the epitope. This feature re- 
flects the fact that BCRs that decrease contacts with shielding 
residues are more likely to be able to access and make contacts 
with the protected conserved residues. 

Current knowledge on Env structures (Julien et al., 2013; 
Lyumkis et al., 2013; Pancera et al., 2014) indicates that VI is 
likely to be a dynamic, unfolded, and disordered flexible loop, 
and mutations in V2, especially insertions/deletions, can hide 
the conserved residues of a neutralizing epitope. So, paratope 
alterations that weaken interaction with a mutated variable 
loop residue result in an increased binding strength (value of h 
in Equation 1) for a randomly chosen paratope residue that can 
potentially interact with a conserved residue on the epitope (Fig- 
ure 1 D), and vice versa. 

Choice of Immunogens and Immunization Schemes 

In silico, we study three Ag variants, the WT Ag (only unmutated 
residues) and two mutants. Of the 22 most mutable residues in 
the variable loops, 20 residues are mutated in the majority of 
the 141 Seaman test panel sequences (Figure SIB). As these 
highly mutated strains are viable, to maximize the number of 
non-overlapping mutated residues on the Ag variants, we studied 
two mutant strains with 1 1 non-overlapping mutations in the var- 
iable sites. We also studied variants with 4 and 8 such mutations. 
We assume that these Ag variants are not so distal in sequence 
space that they stimulate completely different B cell lineages. 

We investigate three immunization schemes in silico: (1) 
scheme I (WT+v1+v2): WT Ag and two variants administered 
as a cocktail. (2) scheme II (WT|v1+v2): Immunization with WT 
Ag first, followed by administration of the two variants simulta- 
neously. (3) scheme III (WT|v1 |v2): Immunization with WT Ag first, 
followed by sequential administration of the two variants. In our 
murine experiments we studied schemes I and III. 
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Figure 2. Concentration Dependence of GC 
Survival and Antibody Breadth in “WT First” 
Schemes 

(A-C) For a relevant range of Ag concentrations 
(C tot ), the fraction of surviving GCs (%surv, blue) 
and their Ab breadth (red) are shown for (A) 
scheme II (see 1 Ag), (B) scheme II (see both Ag) 
and (C) scheme III, with full (left column) or none 
(right column) Ab feedback. See also Figures S2 
and S3. 





For each immunization protocol, we simulate many GC reac- 
tions. When all steps of immunization and AM are completed, 
we check whether the affinity of each Ab produced by a GC ex- 
ceeds a threshold value against each strain in the Seaman panel. 
Thus, we assess the breadth of cross-reactivity. The breadth of 
coverage is defined as the fraction of test panel sequences to 
which an Ab binds with an above-threshold affinity. We collect 
statistics from many Abs and simulations, and report results 
as histograms or probabilities of obtaining Abs with a certain 
breadth. As many GCs are induced in each vaccinated person, 
these results reflect the probability of obtaining Abs that exhibit 
particular breadths of coverage in a typical individual. 

Model Reproduces Known Features of Affinity 
Maturation with a Single Antigen 

Immunization with only the WT Ag leads to affinity enhancement 
on conserved and variable residues alike (Figure SI E). The num- 
ber of accumulated mutations (seven affinity-affecting mutations 



* tot 

in 2 weeks, Figure SID) and the incre- 
mental changes in interaction strength 
(Figure S1E) are consistent with experi- 
ment (Berek et al., 1991; Kocks and Ra- 
jewsky, 1988; Wedemayer et al., 1997). 
Higher affinity clones continuously 
emerge, producing potent Abs (Figure S2 
C and F). 

The Effect of Antigen 
Concentration 

For a single Ag, if Ag dose is too low (Fig- 
ures S2A-S2B and S2D-S2E, lowest 
C wt ), B cells are unlikely to be selected 
during GC reactions, the GC collapses, 
and there is no Ab production. If Ag 
dose is too high (Figures S2A-S2B and 
S2D-S2E, highest C wt ), selection is 
easy, there is little competition between 
B cells, and GCs are rapidly filled with 
low-affinity clones (Eisen and Siskind, 
1964; Goidl et al., 1968). Our agent- 
based stochastic approach naturally 
reproduces the inverse correlation be- 
tween Ag concentration and heterogene- 
ity in Ab affinities (Figures S2G and S2H) 
observed in early experiments (Eisen and 
Siskind, 1964). 

For the three immunization schemes with antigen variants, Ag 
concentration is again a very important variable (Figure 2). If the 
Ag concentration is too low, most GCs collapse as expected. 
However, there is a curious decline in GC survival and Ab 
breadth if the Ag concentration is too high for immunization 
schemes II and III. During AM after immunizing with a high 
dose of WT Ag, many B cells survive easily and the GC reaction 
draws to an end quickly. Therefore, these B cells are largely low- 
affinity clones that have not accumulated mutations that 
enhance contacts with the conserved residues of the epitope. 
So, upon administering either a cocktail of the two Ag variants 
or just one variant, these B cells have a small chance of binding 
sufficiently strongly with the mutants and surviving, and so GCs 
collapse despite the abundance of Ag. The few Abs produced do 
not develop breadth as the large number of mutations required to 
confer breadth do not evolve (see examples in Figure S3). 

Hereon, for each immunization scheme, we show results for 
the Ag concentration (listed in Table 1) that yields Abs with a 
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Table 1. Properties of Antibody Responses Produced by Various 
Immunization Schemes 
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%surv: percentage of seeded GCs that produce antibodies efficiently; 
Breadth: defined in text; h c (/? v ): the average strength of interactions 
with conserved (variable) residues of the epitope— large values of h c indi- 
cate focusing on the conserved residues of the epitope; C tot : antigen 
concentration. 



near optimal breadth (peak of red curves in Figure 2). In scheme 
III, the chosen concentration is slightly higher to ensure high Ab 
production. 

Immunizing with a Cocktail of WT Ag and Two Variants 
Fails to Elicit an Effective Response 

Figure 3A shows in silico results for the distribution of the breadth 
of coverage (for the Seeman panel) of Abs produced using im- 
munization scheme I (cocktail of WT+v1+v2). Ag concentration 
is fixed to yield the greatest breadth, but two more variables 
could be important: during selection in the light zone, does a B 
cell-FDC encounter involve one or all Ag variants? Is “Ab feed- 
back” important during AM? For this immunization scheme, in 
most scenarios, just a few percent of GCs survive (Table 1); 
i.e., Ab titers are predicted to be low. To see why, consider first 
the situation where B cells interact with only one Ag variant dur- 
ing each encounter with FDCs. B cells that bind moderately 
strongly to one of the three variants seed the GC, and they do 
not have any strong interactions with the conserved residues 
of the epitope yet. Now consider a B cell that is selected by a 
particular Ag variant in an early round, then mutates, and returns 
to the light zone to encounter a different variant. Without even 
moderately strong contact with the conserved residues, the 
probability that such a B cell is cross-reactive to the new variant 
is small. Thus, apoptosis is the most likely outcome, leading to 
collapse of the GC. For the rare GCs that do survive, there is evo- 
lution of interactions with the conserved residues and dimunition 
of interactions with variable loops during AM, and this confers 
some breadth (relative to immunization with WT Ag alone, see 
Table 1). 



fM(N(N(NrM(NCM(NfMfM 
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Breadth 




oooooooooo 

Coverage 

Figure 3. Distribution of Breadth of Individual Antibodies and Their 
Total Coverage for Surviving GCs in Scheme I— WT+v1+v2 

(A and B) Histograms of breadth for having 11 (A) or 4 (B) non-overlapping 
mutations in the two variants. Histograms are obtained as a distribution for all 
the surviving GCs. 

(C) Histograms of coverage for surviving GCs under the same conditions as in 
(B). The coverage is defined as a sum of non-overlapping specificities. Results 
are shown for the cases of “see 1 Ag” (filled bars) and “see all Ag” (unfilled 
bars), with Ab feedback. 

See also Figure S4. 



If B cells interact with all Ag variants during each encounter 
with FDCs, and the Ag concentration is chosen to be the one 
that confers optimal breadth, the percentage of surviving GCs 
and the distribution of breadth is not very different compared 
to the case where only one Ag variant is encountered at a time 
(Figure 3A). However, importantly, the optimal concentration is 
lower when B cells can interact with all Ag variants. If the Ag con- 
centrations were made the same as when B cells interact with 
one Ag at a time, then most GCs would survive when all Ag 
variants are simultaneously encountered (Figure S4B). This is 
because, in this case, B cells that bind sufficiently strongly to 
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Figure 4. Comparison of Statistics for Surviving GCs in Scheme II— WT|v1+v2 and Scheme III— WT|v1|v2 

(A-F) Shown are histograms for breadth, mean interaction strength with conserved residues {h c ) and with variable residues {h v ) (A-D), and for total coverage (E and 
F) via a polyclonal response. Results are shown for “Ab feedback” (A, C, and E) and “peers only” (B, D, and F) scenarios. For scheme II (A and B), cases of “see 1 
Ag” (filled bars) and “see both Ag” (unfilled bars) are shown. Total coverage (E and F) is presented for scheme III (WT|v1 |v2) by red filled bars and for scheme II 
(WT|v1+v2, see 1 Ag) by blue unfilled bars. See also Figure S5. 



any one Ag variant can progress through AM. As a consequence, 
the generated Abs are likely to be strain specific. Thus, if Ag is 
displayed homogeneously on FDCs, for immunization scheme 
I, optimal conditions for inducing Abs efficiently at high titers 
would not induce bnAbs. The lower Ag concentration required 
for optimal breadth results in the survival of only a relatively 
low fraction of GCs (Figure 3A). 

We also carried out calculations where the Ag variants differ 
from each other by only four non-overlapping mutations. When 
all variants are simultaneously encountered in each interaction 
with FDCs and Ab feedback is turned on, comparing results for 
this case (Figure 3B) with that when the variants differ by 1 1 
mutations (Figure 3A), we find that a much higher fraction 
(10% versus 1%) of GCs succeed in producing Abs (Fig- 
ure S4D). This is because the Ags vary less from each other, 
and so the chance of a B cell being cross-reactive to variants 
in early stages of AM is higher, thus increasing the probability 
of it being positively selected. The breadth of the Abs pro- 
duced is not large because many variable residues that are 
unmutated in the Ag variants are mutated in the test panel 
sequences, and strain specific Abs are likely to be produced 
as all Ag variants on FDCs are simultaneously encountered. 
This is why the total coverage (Figure 3C), defined as the 
breadth of coverage offered by the polyclonal response from 
all the Abs, is higher than the average breadth of coverage 
of individual Abs (Figure 3B). Results for the case where only 
one Ag variant is encountered at a time on FDCs are shown 
in Figures 3B and 3C (filled bars), and for no Ab feedback 
see Table 1 . 



In our in silico studies of this immunization scheme, we did 
not find any conditions that result in bnAbs with high probability 
(Figure S4). This is because B cell evolution during AM is made 
difficult by the potentially conflicting selective pressures 
imposed by Ag variants simultaneously displayed on FDCs. 
We term this phenomenon, frustrated AM . The degree to which 
AM is frustrated is controlled by a complex interplay between 
Ag concentration, heterogeneity of Ag display on the FDCs, the 
number of mutations that separate the Ag variants, and the 
extent to which Ab feedback is important. 

Immunizing with WT Ag First and then a Cocktail of the 
Two Variants Yields Abs with Significant Breadth in 
Special Circumstances 

Consider first the situation in which there is no Ab feedback and 
only one Ag variant is encountered at a time on FDCs (Figure 4B 
filled bars). In the first period of AM induced by the WT Ag, 
moderately strong contacts with the conserved residues of the 
epitope can evolve. During AM following immunization with the 
cocktail, individual B cells have equal chance of encountering 
either Ag variant during selection. Since the two variants have 
non-overlapping mutated residues, beneficial BCR mutations 
for one strain are deleterious for the other. Maturing B cell line- 
ages are frustrated in satisfying these conflicting requirements 
for selection in successive rounds, but less so than in scheme I 
because moderately strong interactions with the conserved res- 
idues have evolved during AM driven by the WT Ag prime, result- 
ing in some cross-reactivity. Therefore, a greater fraction (~1 0%) 
of GCs survive compared to scheme I. Since a relatively high Ag 
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concentration is required for GC survival in this case (Figure 2A 
right panel blue curve), B cells can expand readily and GCRs 
terminate quickly. Short maturation times limit the average 
breadth (~35%, Figure 2A right, red curve), but Abs with large 
breadth do evolve with low probability. 

If there is Ab feedback, lower Ag concentrations still allow GCs 
to mature successfully. This is because Abs produced in previ- 
ous rounds of AM often have lower affinities for the encountered 
Ag than the best clones produced during the current round of 
mutation and selection. This confers a competitive advantage 
to these best B cell clones because they are not just competing 
with each other, but also with the weaker affinity Abs (“Simula- 
tion Methods” in Extended Experimental Procedures). Also, 
the Abs compete effectively with unfavorable mutant cells that 
may emerge and be stochastically selected only to be extin- 
guished in a future round of mutation-selection. Lowering Ag 
concentration diminishes the probability of selection in each 
round, thus allowing AM to proceed for a longer time before all 
the Ag on FDCs is internalized. This allows mutations that confer 
breadth to accumulate. This balance of lowering frustration 
through Ab feedback and increasing frustration by reducing Ag 
concentration results in a very high survival rate for GCs and 
considerable breadth in the Ab response (Figure 2A left, Fig- 
ure 4A filled bars). 

If multiple Ags are encountered during each interaction with 
FDCs, the outcome is different (Figures 4A and 4B, unfilled 
bars). Since a choice of being selected by the same variant 
(rather than a randomly chosen one of the two) is always avail- 
able in successive rounds of AM, the frustration is low. In most 
cases, we find that B cells are selected by one of the variants 
repeatedly, and so strain-specific Abs are likely to evolve (Table 
1). Furthermore, access to both Ag variants allows B cells to 
multiply successfully in the GC and quickly consume all the Ag 
on FDCs. The duration of AM is short, so mutations that confer 
breadth evolve rarely, and the average breadth is no more than 
40%, regardless of the extent of Ab feedback (Table 1). For 
this optimal breadth to develop, the antigen concentrations 
must be much lower than when only one Ag variant is seen during 
each B cell-FDC encounter (Figure 2B versus 2A). 

Our results suggest that this immunization scheme has the po- 
tential to produce bnAbs (~75% breadth) with high probability if 
B cells encounter one Ag variant at a time on FDCs and all Abs 
produced in previous rounds migrate through ongoing GCs. 
Rather than the extreme situations of full Ab feedback or none 
at all, there is some middle ground, and also most likely, B cells 
encounter one FDC Ag variant in some rounds of selection and 
all variants during others. Thus, our results suggest that, de- 
pending upon circumstances, this immunization scheme pro- 
duces Abs efficiently with a probability between 10%-80%, 
and with average breadths ranging from 30% to 80% (Table 1). 
This scheme may be analogous to what ensues in patients 
who develop bnAbs upon natural infection as they are first 
exposed to the infecting strain, which then diversifies. However, 
the sensitivity of our results to varying conditions suggest that 
it may be quite difficult to induce bnAbs consistently in diverse 
patients using this immunization scheme. This is because the 
conditions noted above are likely to vary between and within 
individuals. 



Sequential Immunization with Antigen Variants Leads to 
Efficient Induction of an Antibody Response with Broad 
Specificity 

Sequential immunization with the three Ag variants temporally 
separates the mutually conflicting selective driving forces 
imposed by multiple variants. Confounding effects associated 
with whether one or multiple types of Ags participate in each B 
cell-FDC encounter in the GC are also obviated. 

Consider first the situation where there is no Ab feedback 
(Figure 4D). Maturation against the WT Ag results in some 
strong interactions with the conserved residues. When AM en- 
sues with the first variant, two main types of lineages evolve 
in the simulations (Figure S5). Both types of lineages tend to 
reduce interactions with the residues that are mutated in the first 
variant Ag. But, in one type of lineage, interactions with the un- 
mutated variable residues in the first variant (which are mutated 
in the second variant) are enhanced; in the other type of lineage, 
these interactions do not change much. When the second 
variant with non-overlapping mutations is introduced, the latter 
lineages have a good chance of outcompeting the former ones 
during AM. This is because clones that did not increase their 
footprint on the variable residues that are now mutated in the 
second variant can simply reduce interactions with these resi- 
dues, and focus strongly only on the conserved residues of 
the epitope. This is why Abs produced in scheme III exhibit a 
narrow distribution of very large breadth (Figure 4D top panel) 
rather than a broad distribution of Abs with moderate breadth 
seen in the analogous situation using scheme II (Figure 4B top 
panel, filled bars). In scheme II, lineages specific for one or 
the other variant strain have a higher chance of survival. In 
scheme III, Abs that focus contacts with the conserved residues 
of the epitope and minimize interactions with all the variable res- 
idues are more likely to emerge (Figures 4B and 4D, middle 
panels). 

Comparing “Ab feedback” (Figure 4C) and “peers only” sce- 
narios (Figure 4D) in scheme III, we see that Ab feedback leads 
to narrower distributions of Abs with large breadth, due to the 
same effects of Ab feedback noted for scheme II. As shown in 
Figures 4E and 4F, the distributions of total coverage for Abs pro- 
duced by schemes II and III are very close when there is Ab feed- 
back (Figure 4E), and the average values are both as high as 
90%. If there is no Ab feedback (Figure 4F), however, scheme 
III gives a similar broad coverage (89%), whereas scheme II 
yields a moderate coverage (51%) even with polyclonal re- 
sponses. So, scheme III is predicted to be more robust to 
many varying conditions. 

Experiments in Mice with a Model System Show that 
Antibodies that Focus on the Conserved Residues 
Emerge upon Sequential Immunization, but Not upon 
Immunization with a Cocktail 

Our calculations predict that, compared to the other immuniza- 
tion schemes we studied, truly cross-reactive Abs that 
focus contacts on the conserved residues of the epitope with 
a small footprint outside are most likely to emerge robustly 
upon sequential immunization with Ag variants. Immunization 
with a cocktail of Ags is predicted to result in a significantly 
worse outcome compared to sequential immunization. We 
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Figure 5. Immunization of Mice with Variant Immunogens 

(A) Schematic of an Fc-gp120 immunogen. 

(B) Immunization groups and dosing schedule for mouse experiment. 

(C-E) Serum titrations (day 48) of a representative mouse from each group— (C) sequential, (D) parallel, (E) core-only— on yeast displaying stripped core (black), 
clone A (blue), clone B (red), or clone C (green). Each plot also includes serum from an unimmunized mouse binding to yeast displaying clone C (black dashed). 
Binding data are fit to a monovalent binding isotherm of the form y-y m ax c/// — 1 / (c/// — 1 + K ~^ ) where “dil” is the serum dilution and y is the binding signal in MFU. 
Discussion of the fitted parameters can be found in the supplemental material. 

See also Figure S6 and Table SI . 



note previous immunization studies that have suggested the ad- 
vantages of sequential immunization of HIV Env in generating 
cross-reactive Abs, in further agreement with our computational 
results (Malherbe et al., 2011; Pissani et al., 2012). Here, we 
tested our predictions comparing sequential immunization and 
cocktails in mouse studies using precisely engineered Ag vari- 
ants. It is important to note that the variant Ags we have engi- 
neered are gp120 monomers of Env, not trimers. Thus, they 
cannot be expected to produce bnAbs that neutralize virus par- 
ticles with intact trimeric spikes. The goal of our experiments was 
to precisely test whether cross-reactive Abs that focus on the 
conserved residues of an epitope (CD4bs) are more likely to 
develop upon sequential immunization, compared to administra- 
tion of a cocktail of the same immunogens, as per our in silico 
predictions. 

As described in Extended Experimental Procedures, we engi- 
neered variants of a minimized gp120 core protein immunogen 
(the “stripped core”) that incorporate diversification at 43 sur- 
face residues outside the CD4bs, while retaining binding affinity 
to the VRC01 bnAb (Table SI, Figure S6). These immunogens 
are designed to present biochemically novel residues outside 
the CD4bs such that only the desired CD4bs epitope is con- 
served across all the immunogens. We used four engineered var- 
iants (including stripped core) that fit these criteria. Our in silico 
results should hold irrespective of whether three or four variants 
are used. 



BALB/c mice were immunized intranasally with HEK293-pro- 
duced immunogens fused to mouse Fc (Figure 5A) with a CpG 
oligonucleotide adjuvant, following protocols described else- 
where (Ye et al., 2011). Immunogens were administered every 
2 weeks at a total protein dose of 50 pmol. Animals were divided 
into three groups (Figure 5B). The “Sequential” group of four 
mice was immunized sequentially with each of the four Ag vari- 
ants. The “Parallel” group of four mice received a cocktail of 
all four variants at each administration. The “core-only” group 
of two mice was given three doses of the same stripped core 
immunogen. Serum was collected every week. 

Serum binding to the Ag variants was assayed by flow cytom- 
etry of yeast displaying each of the four variants. Representative 
curves for each of the three immunization groups are shown in 
Figures 5C-5E. Sequentially immunized mice exhibit similar 
binding to each of the four Ag variants, consistent with serum 
that recognizes and focuses contacts with a shared epitope pre- 
sented on all variants (Figure 5C). Mice immunized with a cocktail 
of four variants show a broader spread of binding affinities, 
consistent with dominant serum specificities for some variants 
but not others (Figure 5D). In the mouse represented, for 
example, it appears that the serum recognizes the epitope on 
stripped core that is not present on clones A or C. Serum from 
mice immunized only with stripped core bind strongly to stripped 
core but not to the other immunogens, consistent with our immu- 
nogen design objective that the four variants not share common 
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Figure 6. Determination of Serum Specificity for the CD4 Binding Site 

(A) Fractional binding to D368R mutant versus stripped core of serum from each group after three (open markers) and four (filled markers) immunizations. The 
fractional binding to D368R for antibody VRC01 at various concentrations is shown as lines: 0.21 at 93 nM (dashed line), 0.1 0 at 75 nM (dash-dotted line), and 0.06 
at 8.2 nM (dotted line). 

(B) Schematic and data from ternary complex assay in which gp120 (CD4bs shown in gray) is pre-incubated with serum then added to VRC01 yeast. The for- 
mation of a ternary complex is only possible if VRC01 is able to bind to its epitope on gp120 after serum has bound. The figure shows the percentage of yeast 
cells positive for mouse serum from each immunization group— sequential (solid line), parallel (dashed line), core-only (dash-dotted line)— for various time points 
(mean ± SD). 

(C) Schematic and data from ternary complex assay in which gp120 is pre-loaded onto VRC01 yeast then incubated with serum. The formation of a ternary 
complex is only possible if serum antibodies are able to bind to their epitope(s) after VRC01 has bound. The figure shows the percentage of yeast cells positive for 
mouse serum from each immunization group as above. 

(D) Analysis of two monoclonal antibodies from a sequentially immunized mouse: fractional binding to D368R versus stripped core gpl 20 was assayed with 1 0 nM 
monoclonal antibody (analogous to 6A); percentage binding in the ternary complex assay (analogous to 6C), in which gpl 20 is pre-bound to VRC01 yeast. 



surface residues outside the CD4bs (Figure 5E), like the non- 
overlapping mutations in Ag variants in our calculations. 

Serum specificity for the CD4bs was determined in two 
ways. First, we measured the binding of serum to the stripped 
core mutant D368R, which is known to disrupt the binding of 
several CD4bs-directed Abs (Thali et al., 1992). Serum samples 
collected after the third and fourth immunizations were incu- 
bated at a 1:100 dilution with yeast displaying either stripped 
core or the D368R mutant (Figure 6A). For mice from the sequen- 
tial immunization group, the fractional binding of serum to D368R 
was similar to that of VRC01 at saturating concentrations. Of the 
mice that were immunized with a cocktail of all four immunogens, 
three are very sensitive to D368R after three immunizations but 
appear to lose this specificity by the fourth administration. 

A second assay used to determine specificity was to evaluate 
the simultaneous binding of VRC01 and serum antibodies to 
gpl 20 (Experimental Procedures). The assay was performed 
first by pre-incubating the gpl 20 with mouse serum before intro- 
ducing VRC01 -displaying yeast (Figure 6B). If serum antibodies 
compete with VRC01 for its binding site, the gpl 20-serum com- 



plex will not bind to the yeast. No binding signal was observed for 
any of the serum samples, suggesting that all contained some 
fraction of VRC01 -competitive antibodies. This result is consis- 
tent with the D368R gpl 20 binding results discussed above. 

The same assay was then performed by pre-incubating gpl 20 
with VRC01 yeast before introducing the serum (Figure 6C) to 
determine whether the serum contains any specificity other 
than to the CD4bs. Note that due to plasmid loss in yeast during 
culture, ~60% positive cells is the maximum observable signal in 
the assay. Mice immunized with just stripped core (53%, 56%) or 
with a cocktail of four Ags (mean = 32% ± 5%) developed anti- 
bodies that bind to other epitopes on gpl 20. However, mice 
immunized sequentially did not (mean = 1 .0% ± 0.3%). These ex- 
periments suggest that sequential immunization with Ag variants 
elicited a serum response that was VRC01 -competitive and 
entirely focused on the conserved residues targeted by the 
VRC01 Ab. Mice that were exposed to a cocktail of the variants 
generated some VRC01 -competitive antibodies, but also Abs 
that targeted non-conserved residues on a single immunogen, 
i.e., were not cross-reactive. In other words, only sequential 



794 Cell 160 , 785-797, February 12, 2015 ©2015 Elsevier Inc. 




Cell 



immunization robustly elicited a response that is focused on the 
conserved residues of the desired epitope. These findings are in 
harmony with our in silico results. 

To establish that individual CD4bs-competitive antibodies 
were generated by sequential immunization, hybridomas were 
generated by splenotypic fusion from a mouse immunized using 
this protocol. Two distinct gpl 20-binding monoclonal antibodies 
were isolated (for sequences, see Extended Experimental Pro- 
cedures). The antibodies both targeted the CD4bs epitope, as 
both were disrupted by the D368R mutation (at 10 nM antibody, 
fractional binding to D368R versus stripped core gpl 20 of 1 .4% 
and 15.6%) and neither was able to bind to stripped core that 
had been pre-blocked with VRC01 (Figure 6D). These data 
make clear that the monoclonal antibodies are indeed focused 
on the CD4bs upon sequential immunization with the variant 
antigens. 

DISCUSSION 

The induction of bnAbs against highly mutable pathogens, such 
as HIV and influenza, will require immunization with Ag variants. 
Rational design of immunogens and efficient immunization 
protocols that elicit bnAbs requires mechanistic understanding 
of a basic problem in immunobiology: viz., how AM occurs in 
the presence of variant Ags. Our studies of AM in this setting 
have revealed new concepts, suggested new avenues for 
experimental and theoretical research, and have practical 
value. An important concept that emerges is that, when multi- 
ple Ag variants are used as immunogens, they can present 
conflicting selection forces during AM which frustrates the evo- 
lution of Abs. This phenomenon is most acutely manifested 
upon immunization with a cocktail of Ag variants. In this case, 
a B cell of a particular lineage that is selected in one round 
by a particular Ag variant is likely, after the next round of muta- 
tion, to encounter a different Ag variant on FDCs. This B cell is 
unlikely to be positively selected in this encounter, thus ending 
the evolution of a potentially favorable B cell lineage. The de- 
gree to which AM is frustrated is determined by a complex 
interplay of effects dependent on Ag concentration, how het- 
erogeneously Ag variants are displayed on FDCs, the number 
of mutations that separate the Ag variants, and the extent to 
which previously generated Abs migrate through ongoing 
GCs. Our results provide empirical evidence that these param- 
eters can be tuned to achieve an optimal level of frustration. 
Too much frustration prevents GC reactions from evolving 
favorable mutations that confer breadth. Too low a level of frus- 
tration (the extreme case being immunization with one Ag) re- 
sults in the development of strain-specific antibodies. An 
avenue for future theoretical research is to define a quantitative 
metric of frustration that describes precisely how the degree of 
frustration depends on the pertinent variables. The metric of 
frustration could then be optimized to design effective immuni- 
zation protocols. 

Our in silico results suggest that if one primes with the WT 
Ag, and then boosts with a cocktail of variants, bnAbs can 
emerge but only if some special conditions hold. Such a sce- 
nario may have been the driver of bnAb production in naturally 
infected patients because this immunization strategy mimics 



natural infection with one strain and subsequent viral diversifi- 
cation. However, our results show that deviations from these 
special conditions make the evolution of bnAbs unlikely in 
this setting (Table 1). This suggests that potent bnAbs evolve 
during natural infection only when special conditions are met, 
which happens rarely, and thus take a long time to emerge 
by chance. Controlling immunization protocols precisely to 
satisfy special conditions in every vaccinated individual seems 
difficult. Furthermore, even if the conditions were set to 
perfectly mimic a situation that resulted in the evolution of 
bnAbs upon natural infection, the outcome may not be favor- 
able because Darwinian evolution is inherently stochastic and 
cannot be replicated exactly. So, although it is important to un- 
derstand how bnAbs are produced during natural infection, 
mimicking these conditions may not be an efficient way to 
induce bnAbs by vaccination. 

We find that when Ag variants are administered sequentially, 
bnAbs can develop with relatively high probability over a wider 
range of conditions than if other strategies are deployed (Table 
1 , and Figures 2, 3, and 4). In this case, the conflicting selection 
forces due to the variant Ags are temporally separated. This can 
lead to successful evolution of multiple B cell lineages that first 
develop moderate interactions with the conserved residues of 
the epitope and then progressively acquire mutations that evade 
contacts with one set of variable residues and then another. 
Evolution of a set of possible clones which have a chance at 
maturing into bnAbs after AM against the first variant makes suc- 
cess more likely as all of them are unlikely to go extinct during 
stochastic mutation and selection upon immunizing with the sec- 
ond variant. Our in silico results predict that this immunization 
strategy is superior to immunization with a cocktail of the same 
set of variants, and our model experiments in mice support this 
prediction. 

Sequential immunization with Ag variants that share a single 
epitope to induce B cells specific for the common epitope 
has been tried with linear peptide epitopes derived from HIV 
gp41 , though the resulting Abs were non-neutralizing (Guenaga 

et al., 201 1 ; Correia et al., 2010). Past work also emphasizes the 
importance of masking irrelevant epitopes, but by itself, this 
strategy is insufficient for successful vaccination (Pantophlet 
et al., 2003; Selvarajah et al., 2005). Our in silico and in vivo 
studies suggest that precisely engineered variant intact virus 
spike immunogens administered in a sequential fashion under 
appropriate conditions may offer the best opportunities for in- 
duction of bnAbs against HIV-1 and other highly mutable 
pathogens. 



EXPERIMENTAL PROCEDURES 
In Silico Model 

The probability of a B cell internalizing antigen is given by: 

Y nA ,Cie( EiJ - Ea )/ kBT 

pi _ ^7 = 1 > 

8 1+ E ,"=1 Cye( E *- £ *)/' , ° r ' 



(1) 



Here Cj is the concentration of Ag of type j presented on the FDCs. The sum 
over the Ag index j runs through n A distinct types of Ags that B cell / could 
potentially interact with simultaneously during an encounter with a FDC 
bearing multiple Ag variants. 
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The probability of a B cell to succeed in receiving T cell help is given by: 
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where <Z)/ti eE ' v//fs7 ")/'(^/) is the average probability of internalizing Ag of all the 
other B cells present in the GC. C to f = is the total Ag concentration, and 

the dependence on this variable accounts for the observation that the number 
of activated T helper cells increases with Ag dose, thus making it more likely 
that B cells receive T cell help. 

Binding Assay to Determine Competitiveness with VRCOI 

Yeast displaying the scFv of VRCOI was incubated with soluble stripped core 
gp120 and an excess of mouse antiserum. The formation of a ternary com- 
plex— a sandwich of VRCOI, gp120, and mouse serum— was detected as 
the presence of mouse antibodies on yeast by flow cytometry (% positive 
cells). The experiment was done in two ways as described in text. 

See Extended Experimental Procedures for additional information. 
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MAPT 


Microtubule-associated protein tau 


17q21 .1 


FTD 


Toxic aggregation (defect in neuronal cytoskeleton) 


GRN 


Granulin 


17q21 .32 


FTD* 


Autophagy; lysosomal pathway; inflammation 


TREM2 


Triggering receptor expressed on myeloid cells 2 


6p21 .1 


FTD 


Inflammation 


CHMP2B 


Charged multivesicular body protein 2B 


3p1 1 .2 


FTD 


Autophagy; lysosomal pathway 


C9orf72 


Chromosome 9 open reading frame 72 


9p21 .2 


FTD, ALS 


Toxic RNA or repeat dipeptides aggregation 


SQSTM1 


Sequestosome 1 


5q35 


FTD, ALS* 


Autophagy 


UBQLN2 


Ubiquilin 2 


Xpl 1 .21 


FTD, ALS* 


Autophagy 


VCP 


Valosin-containing protein 


9p13.3 


FTD, ALS* 


Autophagy 


OPTN 


Optineurin 


10p13 


FTD/ALS* 


Autophagy 


SOD1 


Superoxide dismutase 1 , soluble 


21q22.1 1 


ALS 


Toxic aggregation; free radical scavenger enzyme 


FUS 


FUS RNA binding protein 


1 6p1 1 .2 


ALS 


DNA/RNA metabolism 


TARDBP 


TAR DNA binding protein 


1p36.22 


ALS 


DNA/RNA metabolism 


CHCHD10 


Coiled-coil-helix-coiled-coil-helix domain containing 10 


22q1 1 .23 


FTD, ALS 


Mitochondrial function 


ALS2 


Amyotrophic lateral sclerosis 2 (juvenile) 


2q33.1 


ALS* 


Modulator for endosomal dynamics 


SPG 11 


Spastic paraplegia 1 1 (autosomal recessive) 


15q14 


ALS* 


DNA damage repair 


SETX 


Senataxin 


9q34.13 


ALS* 


DNA/RNA processing 


MATR3 


Matrin 3 


5q31 .2 


ALS* 


DNA/RNA metabolism 


ANG 


Angiogenin, ribonuclease, RNase A family, 5 


1 4q1 1 .1 -ql 1 .2 


ALS 


Blood vessel formation 


VAPB 


VAMP (vesicle-associated membrane protein)-associated 
protein B and C 


20q13.33 


ALS* 


Vesicle trafficking 


PFN1 


Profilin 1 


17p13.3 


ALS 


Actin dynamics 


TAF15 


TAF15 RNA polymerase II, TATA box binding protein (TBP)- 
associated factor, 68kDa 


1 7q1 1 .1 -ql 1 .2 


ALS 


RNA metabolism 


HNRNPA1 


Fleterogeneous nuclear ribonucleoprotein A1 


1 2q1 3.1 


ALS, FTD* 


RNA metabolism; direct interaction with TDP-43 


HNRNPA2B1 


Heterogeneous nuclear ribonucleoprotein A2/B1 


7p15 


ALS, FTD* 


RNA metabolism; direct interaction with TDP-43 


ERBB4 


Erb-b2 receptor tyrosine kinase 4 


2q33.3-q34 


ALS 


Dysregulation of the neuregulin-ErbB4 pathway 


ARHGEF28 


Rho guanine nucleotide exchange factor (GEF) 28 


5q13.2 


ALS 


Interaction with low-molecular-weight neurofilament mRNA 


DAO 


D-amino-acid oxidase 


12q24 


ALS 




GLE1 


GLE1 RNA export mediator 


9q34.1 1 


ALS 


RNA metabolism 


SIGMAR1 


Sigma non-opioid intracellular receptor 1 


9p13.3 


ALS, FTD 


Endoplasmic reticulum lipid rafts 


ERLIN2 


ER lipid raft associated 2 


8p1 1 .2 


ALS* 


Endoplasmic reticulum lipid rafts 


PNPLA6 


Patatin-like phospholipase domain containing 6 


19p13.2 


ALS* 


Neurite outgrowth and process elongation 


PRKAR1B 


Protein kinase, cAMP-dependent, regulatory, type 1, beta 


7p22 


FTD* 


Regulation of metabolism, ion transport, and gene transcription 


DCTN1 


Dynactin 1 


2p13 


ALS* 


Vesicle trafficking 


RISK OR MODIFIER GENES 


TMEM106B 


Transmembrane protein 


7p21 .3 


FTD 


Regulation of lysosomal function and progranulin pathways 


ATXN2 


Ataxin 2 (susceptibility) 


12q24.1 


FTD/ALS 


Modifier of tdp-43 toxicity 


UNCI 3 A/ KCNN1 


Unc-13 homolog A (C. elegans)/ potassium channel, 
calcium-activated intermediate/small conductance subfamily 
N alpha, member 1 


1 9p1 3.1 1 
/1 9p1 3.1 


ALS 


Neurotransmitter release/neuronal excitability 


ZNF512B 


Zinc finger protein 51 2B 


20q13.33 


ALS 
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Frontotemporal dementia (FTD) and amyotrophic lateral sclerosis (ALS) are considered to be part of a spectrum. Clinically, FTD patients present with dementia frequently charac- 
terized by behavioral and speech problems. ALS patients exhibit alterations of voluntary movements caused by degeneration of motor neurons. Both syndromes can be present 
within the same family or even in the same person. The genetic findings for both diseases also support the existence of a continuum, with mutations in the same genes being 
found in patients with FTD, ALS, or FTD/ALS. 

The figure on the left represents the distribution of genes according to their associated phenotype and mutation frequency in familial cases (this can vary between cohorts 
or populations). 

Some cases of mutations in genes represented in the extremes of the graph (associated with either ALS or FTD) have also been described as associated with the other clinical 
phenotypes. For example, mutations in TARDBP that are the cause of ALS have, in rare occasions, been described in cases presenting with FTD. The genetic evidence for these 
associations is weak (mainly lacking segregation of the variants with the phenotypes or strong evidence of pathogenicity), and for these reasons, such genes are represented 
as the extremes of the FTD-ALS continuum. 

Genetic ALS is usually not only associated with motor systems degeneration but also, very frequently, occurs in combination with degenerative processes like parkinsonism 
or ataxia. In fact, very few ALS genes have been associated with a strict motor-only phenotype, and genes commonly associated with other diseases have sometimes been 
found to be the cause of ALS. This was the case for FIG4 and SPAST mutations that have been originally associated with Charcot- Marie-Tooth neuropathy type 4J (CMT4J) and 
autosomal-dominant spastic paraplegia-4 (SPG4), respectively, and have been subsequently found to be mutated in ALS cases. 

These overlaps between different phenotypes and genes found to be mutated either in ALS with additional features or in other diseases and ALS are represented in the right panel. 

One FTD gene ( PPKAP1B ) and several ALS genes ( APHGEF28 , for example) have been, so far, only reported in a single family (with different levels of segregation) or have not yet 
been independently replicated. PPKAP1B was identified in an FTD-like family with a syndrome considered to be novel. The family presented dementia and/or parkinsonism in 12 
affected individuals and a unique neuropathology displaying abundant neuronal inclusions by hematoxylin and eosin staining throughout the brain with immunoreactivity for inter- 
mediate filaments. Some ALS genes have conflicting results with variants described in controls and uncertain pathogenicity attributed to different mutations ( ANG , for example). 

Several genes implicated in FTD and ALS have possible functions within the same biological pathways such as autophagy and the lysosomal pathway, maintenance of neu- 
ronal cytoskeleton, and DNA/RNA metabolism. In some cases, genes also share similar domains like the RNA binding genes with prion-like domains HNPNPA1, HNPNPA2B1, 
TAPDBP, FUS, TAF15, and EWSP1. In fact, TAF15 and EWSP1 were screened for the identification of variants because of their commonalities with FUS and TAPDBP. In both 
genes, variants were found in patients with sporadic ALS, and weak familial segregation has only been demonstrated for TAF15. 

The use of next-generation sequencing technologies has recently allowed the identification of rare variants in TUBA4A to be associated with ALS, and the analyses of de novo 
mutations in sporadic ALS trios have identified mutations in SS18L1. Again, the approaches used did not allow for segregation analyses and, for this reason, even though the 
variants were associated with familial ALS, the genes are not included as a Mendelian gene for ALS in the table. 

Other genes have been reported as susceptibility or modifiers of FTD and/or ALS with different levels of evidence. These include, in addition to the ones represented in the 
table, BTNL2/HLA-DPA/HLA-DPB5, PAB38/CTSC, CHPNA3/4, CHRNB4, PPPH, CHGB, NEFH, VEGF, HFE, PON1/2/3, DPP6, ELP3, ITPP2, FGGY, CHGB, DPYSL3, KIFAP3, 
EPHA3/4, PPAPGC1A, APOE, MAOB, CX3CP1, and SMN1. 

The loci identified by genome-wide association studies are named according to the closest gene to the most significant hit at each locus. It is possible that the gene involved 
in FTD and/or ALS is not the closest but is in linkage disequilibrium with the true hit. This has been recently shown to be the case for UNC13A, where KCNN1 (a nearby gene) 
was implicated by eQTL analysis. 

An asterisk (*) indicates genes for which mutations may present additional phenotypes or be the cause of different clinical syndromes. For example, loss-of-function hetero- 
zygous mutations in GPN cause FTD, whereas homozygous mutations in the same gene have recently been reported to cause neuronal ceroid lipofuscinosis-11 (CLN11); VCP 
mutations are known to cause ALS with or without FTD and inclusion body myopathy with Paget disease of bone and frontotemporal dementia (IBMPFD). 

ABBREVIATIONS 

CMT neuropathy, Charcot-Marie-Tooth neuropathy; POAG, primary open angle glaucoma; SMA/PMA, spinal muscular atrophy/progressive muscular atrophy; AOA2, Ataxia with 
oculomotor apraxia type 2. 
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